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Preface  to  First  Edition 


The  importance  assigned  to  accuracy  in  basic  mathematics  courses  has,  initially, 
a  useful  disciplinary  purpose  but  can,  unintentionally,  hinder  progress  if  it  fosters 
the  belief  that  exactness  is  all  that  makes  mathematics  what  it  is.  Multivariate 
calculus  occupies  a  pivotal  position  in  undergraduate  mathematics  programmes  in 
providing  students  with  the  opportunity  to  outgrow  this  narrow  viewpoint  and  to 
develop  a  flexible,  intuitive  and  independent  vision  of  mathematics.  This  possi¬ 
bility  arises  from  the  extensive  nature  of  the  subject. 

Multivariate  calculus  links  together  in  a  non-trivial  way,  perhaps  for  the  first 
time  in  a  student’s  experience,  four  important  subject  areas:  analysis ,  linear 
algebra ,  geometry  and  differential  calculus.  Important  features  of  the  subject  are 
reflected  in  the  variety  of  alternative  titles  we  could  have  chosen,  e.g.  “Advanced 
Calculus”,  “Vector  Calculus”,  “Multivariate  Calculus”,  “Vector  Geometry”, 
“Curves  and  Surfaces”  and  “Introduction  to  Differential  Geometry”.  Each  of 
these  titles  partially  reflects  our  interest  but  it  is  more  illuminating  to  say  that  here 
we  study  differentiable  functions,  i.e. 

functions  which  enjoy  a  good  local  approximation  by  linear  functions. 

The  main  emphasis  of  our  presentation  is  on  understanding  the  underlying 
fundamental  principles.  These  are  discussed  at  length,  carefully  examined  in 
simple  familiar  situations  and  tested  in  technically  demanding  examples.  This 
leads  to  a  structured  and  systematic  approach  of  manageable  proportions  which 
gives  shape  and  coherence  to  the  subject  and  results  in  a  comprehensive  and 
unified  exposition. 

We  now  discuss  the  four  underlying  topics  and  the  background  we  expect — 
bearing  in  mind  that  the  subject  can  be  approached  with  different  levels  of 
mathematical  maturity.  Results  from  analysis  are  required  to  justify  much  of  this 
book,  yet  many  students  have  little  or  no  background  in  analysis  when  they 
approach  multivariate  calculus.  This  is  not  surprising  as  differential  calculus 
preceded  and  indeed  motivated  the  development  of  analysis.  We  do  not  list 
analysis  as  a  prerequisite,  but  hope  that  our  presentation  shows  its  importance  and 
motivates  the  reader  to  study  it  further. 

Since  linear  approximations  appear  in  the  definition  of  differentiable  functions, 
it  is  not  surprising  that  linear  algebra  plays  a  part  in  this  book.  Several-variable 
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calculus  and  linear  algebra  developed,  to  a  certain  extent,  side  by  side  to  their 
mutual  benefit.  The  primary  role  of  linear  algebra,  in  our  study,  is  to  provide  a 
suitable  notation  and  framework  in  which  we  can  clearly  and  compactly  introduce 
concepts  and  present  and  prove  results.  This  is  more  important  than  it  appears 
since  to  quote  T.  C.  Chaundy,  “notation  biases  analysis  as  language  biases 
thought  ”.  An  elementary  knowledge  of  matrices  and  determinants  is  assumed  and 
particular  results  from  linear  algebra  are  introduced  as  required. 

We  discuss  the  role  of  geometry  in  multivariate  calculus  throughout  the  text  and 
confine  ourselves  here  to  a  brief  comment.  The  natural  setting  for  functions  which 
enjoy  a  good  local  approximation  by  linear  functions  are  sets  which  enjoy  a  good 
local  approximation  to  linear  spaces.  In  one  and  two  dimensions  this  leads  to 
curves  and  surfaces ,  respectively,  and  in  higher  dimensions  to  differentiable 
manifolds. 

We  assume  the  reader  has  acquired  a  reasonable  knowledge  of  one-variable 
differential  and  integral  calculus  before  approaching  this  book.  Although  not 
assumed,  some  experience  with  partial  derivatives  allows  the  reader  to  proceed 
rapidly  through  routine  calculations  and  to  concentrate  on  important  concepts. 
A  reader  with  no  such  experience  should  definitely  read  Chapter  1  a  few  times 
before  proceeding  and  may  even  wish  to  consult  the  author’ s  Functions  of  Two 
Variables  (Chapman  and  Hall  1995). 

We  now  turn  to  the  contents  of  this  book.  Our  general  approach  is  holistic  and 
we  hope  that  the  reader  will  be  equally  interested  in  all  parts  of  this  book. 
Nevertheless,  it  is  possible  to  group  certain  chapters  thematically. 

Differential  Calculus  on  Open  Sets  and  Surfaces  (Chapters  1-4). 

We  discuss  extremal  values  of  real- valued  functions  on  surfaces  and  open  sets. 
The  important  principle  here  is  the  Implicit  Function  Theorem ,  which  links  linear 
approximations  with  systems  of  linear  equations  and  sets  up  a  relationship  between 
graphs  and  surfaces. 

Integration  Theory  (Chapters  6,  9,  11-15). 

The  key  concepts  are  parameterizations  (Chapters  5,  10  and  14)  and  oriented 
surfaces  (Chapter  12).  We  build  up  our  understanding  and  technical  skill  step  by 
step,  by  discussing  in  turn  line  integrals  (Chapter  6),  integration  over  open  subsets 
of  R  (Chapter  9),  integration  over  simple  surfaces  without  orientation  (Chapter 
11),  integration  over  simple  oriented  surfaces  (Chapter  12)  and  triple  integrals  over 
open  subsets  of  M3  (Chapter  14).  At  appropriate  times  we  discuss  generalizations  of 
the  fundamental  theorem  of  calculus,  i.e.  Green’s  Theorem  (Chapter  9),  Stokes’ 
Theorem  (Chapter  13)  and  the  Divergence  Theorem  (Chapter  15).  Special  attention 
is  given  to  the  parameterization  of  classical  surfaces,  the  evaluation  of  surface 
integrals  using  projections,  the  change  of  variables  formula  and  to  the  detailed 
examination  of  involved  geometric  examples. 
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IX 


Geometry  of  Curves  and  Surfaces  (Chapters  5,  7-8,  10,  16-18). 

r\ 

We  discuss  signed  curvature  in  R  and  use  vector- valued  differentiation  to 
obtain  the  Frenet-Serret  equations  for  curves  in  R  .  The  abstract  geometric  study 
of  surfaces  using  Gaussian  curvature  is,  regrettably,  usually  not  covered  in 
multivariate  calculus  courses.  The  fundamental  concepts,  parameterizations  and 
plane  curvature,  are  already  in  place  (Chapters  5,  7  and  10)  and  examples  from 
integration  theory  (Chapters  11-15)  provide  a  concrete  background  and  the 

r\ 

required  geometric  insight.  Using  only  curves  in  R  and  critical  points  of  functions 
of  two  variables  we  develop  the  concept  of  Gaussian  curvature.  In  addition,  we 
discuss  normal,  geodesic  and  intrinsic  curvature  and  establish  a  relationship 
between  all  three.  In  the  final  chapter  we  survey  informally  a  number  of  interesting 
results  from  differential  geometry. 

This  text  is  based  on  a  course  given  by  the  author  at  University  College,  Dublin. 
The  additions  that  emerged  in  providing  details  and  arranging  self-sufficiency 
suggest  that  it  is  suitable  for  a  course  of  30  lectures.  Although  the  different  topics 
come  together  to  form  a  unified  subject,  with  different  chapters  mutually 
supporting  one  another,  we  have  structured  this  book  so  that  each  chapter  is  self- 
contained  and  devoted  to  a  single  theme. 

This  book  can  be  used  as  a  main  text,  as  a  supplementary  text  or  for  self-study. 
The  groupings  summarised  above  allow  a  selection  of  short  courses  at  a  slower 
pace.  The  exercises  are  extremely  important  as  it  is  through  them  that  a  student  can 
assess  progress  and  understanding. 

Our  aim  was  to  write  a  short  book  focusing  on  basic  principles  while  acquiring 
technical  skills.  This  precluded  comments  on  the  important  applications  of  mul¬ 
tivariate  calculus  which  arise  in  physics,  statistics,  engineering,  economics  and, 
indeed,  in  most  subjects  with  scientific  aspirations. 

It  is  a  pleasure  to  acknowledge  the  help  I  received  in  bringing  this  project  to 
fruition.  Dana  Nicolau  displayed  skill  in  preparing  the  text  and  great  patience  in 
accepting  with  a  cheerful  “OK,  OK,”  the  continuous  stream  of  revisions,  cor¬ 
rections  and  changes  that  flowed  her  way.  Michael  Mackey’s  diagrams  speak  for 
themselves.  Brendan  Quigley’s  geometric  insight  and  Pauline  Mellon’s  sugges¬ 
tions  helped  shape  our  outlook  and  the  text.  I  would  like  to  thank  the  Third  Arts 
students  at  University  College,  Dublin,  and  especially  Tim  Cronin  and  Martin 
Brundin  for  their  comments,  reactions  and  corrections.  Susan  Hezlet  of  Springer 
provided  instantaneous  support,  ample  encouragement  and  helpful  suggestions  at 
all  times.  To  all  these  and  the  community  of  mathematicians  whose  results  and 
writings  have  influenced  me,  I  say — thank  you! 
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University  College  Dublin, 
Belfield,  Dublin  4, 
Ireland. 
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Fifteen  years  have  elapsed  since  the  first  edition  was  published  and  5  years  have 
gone  by  since  I  last  taught  a  course  on  the  topics  in  this  book.  It  is  nice  to  know  that 
Springer  still  believes  that  new  generations  of  teachers  and  students  may  still  be 
interested  in  my  approach  and  I  am  grateful  to  them  for  allowing  me  the  opportunity 
to  correct  some  errors,  to  revise  some  material,  and  to  pass  on  to  new  readers 
comments  of  previous  readers.  I  have,  I  hope,  maintained  the  style,  format,  general 
approach  and  the  results  of  previous  editions.  I  have  made  changes  in  practically  all 
chapters  but  the  main  changes  occur  in  the  final  three  chapters,  which  is  an  intro¬ 
duction  to  the  differential  geometry  of  surfaces  in  three-dimensional  space.  And  now 
some  important  information  which  was  not  sufficiently  stressed  in  earlier  prefaces: 
as  preparation  to  using  this  book  readers  should  have  completed  a  course  in  linear 
algebra  and  a  first  course  on  partial  differentiation.  Chapter  1  in  this  book  is  a 
summary  of  material  that  is  presumed  known  and  an  introduction  to  notation  that  we 
use  throughout  the  book. 

It  is  a  pleasure  to  thank  Michael  Mackey  for  his  continued  support  and  practical 
and  mathematical  help  in  preparing  this  edition.  Joerg  Sixt  and  Catherine 
Waite  from  Springer  have  been  supportive  and  efficient  throughout  the  period  of 
preparation  of  this  edition. 

Dublin,  Ireland  Sean  Dineen 

e-mail:  sean.dineen@ucd.ie 
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Chapter  1 

Introduction  to  Differentiable  Functions 


Summary  We  introduce  differentiable  functions,  directional  and  partial  derivatives, 
graphs  and  level  sets  of  functions  of  several  variables. 

In  this  concise  chapter  we  introduce  continuous  and  differentiable  functions  between 
arbitrary  finite  dimensional  spaces.  We  pay  particular  attention  to  notation,  as  appro¬ 
priate  notation  is  often  the  difference  between  simple  and  complicated  presentations 
of  several- variable  calculus.  Once  this  is  in  place  many  of  our  calculations  follow 
the  same  lines  as  in  the  one  dimensional  calculus.  We  do  not  include  proofs  but,  for 
readers  familiar  with  analysis,  we  provide  suggestions  that  lead  to  proofs  along  the 
lines  that  apply  in  the  one  variable  calculus. 

The  following  extremely  simple  example  illustrates  the  type  of  calculation  we 
will  be  executing  frequently  and  the  reader  should  practice  similar  examples  until 
they  become  routine  and  the  intermediate  step  is  unnecessary. 

Example  1.1  Let 


/  :  M3  — >  M,  f(x,  y,  z)  =  xey  +  y2z3 . 

df 

The  partial  derivative  of  /  with  respect  to  x,  —  or  fx,  is  obtained  by  treating  y 

dx 

and  z  as  constants  and  differentiating  with  respect  x  in  the  usual  one  variable  way. 
Thus  if  A  =  ey  and  B  =  y2z 3  then  f(x,  y,  z)  =  Ax  +  B  and 

df  d 

—  =  —(Ax  +  B)  =  A  =  ey. 
dx  dx 

Similarly  if  C  =  x  and  D  =  z3  then  f(x,y,z )  =  Cey  +  Dy2  and 

—  =  — ( Cey  +  Dy2)  =  Cey  +  2  Dy  =  xey  +  2  yz3, 
dy  dy 

and,  if  E  =  xey  and  F  =  y2,  then  f(x,  y,  z)  =  E  +  E z3  and 
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%  =  }(E  +  Fz 3)  =  3  Fz2  =  3  y2z2. 
dz  dz 

We  now  recall  concepts  and  notation  from  linear  algebra.  First  we  define  the  dis¬ 
tance  between  vectors  in  W1 .  This  will  enable  us  to  define  convergent  sequences,  open 
and  closed  sets,  continuous  and  differentiable  functions,  and  state  the  fundamental 
existence  theorem  for  maxima  and  minima. 

If  X  =  (jci  , . . . ,  xn)  g  W1  let  ||X||  =  (v2  +  •  •  •  +  x2)1/2  and  call  ||X||  the  length 
(or  norm )  of  X.  If  X  and  Y  =  (y i,  . . . ,  yn)  are  vectors  in  M77  then  \\X  —  F||  is  the 
distance  between  X  and  Y .  The  inner  product  (or  dot  product  or  scalar  product)  of 
X  and  y,  X  •  Y  or  ( X ,  Y ),  is  defined  as 


n 

(X,  Y)  =X-Y  =  ^xl7i. 

i= 1 


We  have  ||X||2  =  (X,  X)  and  two  vectors  X  and  Y  are  perpendicular  if  and  only 
if  their  inner  product  is  zero. 

For  1  <  j  <  n,  let  ey  =  (0,  . . . ,  1,  0,  . . .),  where  1  lies  in  the  j th  position.  The 
set  (e/)”=1  is  a  basis ,  the  standard  wraY  vector  basis}  for  M77 .  If  X  =  (vi ,  . . . ,  vn)  G 
M77  then 

n  n 


X  —  X\e\  +  X2^2  +  •  *  •  +  Xn^n 


/  =  1 


X(x’e/)e/- 

;'=i 


A  mapping  T  :  M77  — >  M777  is  linear  if 

7>X  +  £T)  =  aT(X)  +  bT(Y) 

for  all  X,  Y  g  M77  and  all  a,b  g  R.  For  1  <  i  <  m  and  1  <  j  <  n  let 
=  (T(e/),  e/).  If  X  =  (xi,  . . . ,  xn)  then,  interchanging  the  order  of  summa¬ 
tion,  we  obtain 


n 


n 


T(X)  =  T(YJXJel)=XxjT(eJ> 


7  =  1 

n  m 


7  =  1 


—  ^  Xj  (  ^  (e7 ) >  e/' ) e/' ) 

7  =  1  *  =  1 

m  /? 

z'  =  l  7  =  1 


e,- 


1  We  use  the  same  notation  for  the  standard  basis  in  Rn  and  Mm .  The  context  tells  the  dimension  of 
the  space  involved.  Otherwise  we  would  be  using  more  and  more  unwieldy  notation. 
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m  n 


=z 


i  =  1 


7  =  1 


This  shows  that  T (X)  =  A(X )  where  the  m  x  n  matrix  A  =  iPi,j)\<i<m,l<j<n 
operates  on  the  column  vector  X  by  matrix  multiplication.  We  may  now  identify 
the  space  of  linear  mappings  from  M77  into  Wn  with  the  space  of  m  x  n  matrices, 
Mw>w.  To  present  in  a  reasonable  form  the  product  rule  and  chain  rule  (see  below) 
we  identify  M77  with  Mw>i,  that  is  the  points  in  W1  are  considered  to  be  column 
vectors.  The  reader  should  however  note  that,  for  typographical  convenience,  this 
convention  is  often  ignored  and  points  in  M77  are  written  as  row  vectors.  However,  in 
taking  derivatives  the  correct  convention  should  be  followed  to  avoid  meaningless 
expressions. 

A  subset  U  of  M77  is  open  if  for  each  Xq  e  U  there  exists  s  >  0  such  that2 

{X  e  :  ||X  -  X0||  <  s}  C  U. 

A  subset  A  of  M77  is  closed  if  its  complement  is  open  and  a  set  B  is  bounded  if 
there  exists  MgR  such  that  \\x  ||  <  M  for  all  v  e  B.  Thus  all  points  in  an  open  set 
A  are  surrounded  by  points  from  A  while  any  point  that  can  be  reached  from  a  closed 
set  B  belongs  to  B.  A  crucial  role  in  all  aspects  of  calculus,  analysis  and  geometry  is 
played  by  sets  which  are  both  closed  and  bounded;  such  sets  are  said  to  be  compact. 

Example  1.2  The  closed  solid  sphere  {(x,  y,  z)  :  x2  +  y2  +z2  <  1}  and  its  boundary 
{(x,  y,  z)  :  x2  +  y2  +  z2  =  1}  are  both  compact  subsets  of  M3  while  the  solid  sphere 
without  boundary  {(x,y,  z)  :  x2  +  y2  +  z2  <  1 }  is  an  open  bounded  subset  of  M3 . 
A  line  is  a  closed  unbounded  set  while  every  open  subset  of  M3  is  a  union  of  open 
spheres. 

If  (Xk)™=l  is  a  sequence  of  vectors  in  M77  and  Y  e  M77  then  we  say  that  Xk 
converges  to  Y  as  k  tends  to  infinity  and  write  Xk  — >  Y  as  k  — >  oo  and 
lim^oo  Xk  =  Y  if 

\\Xk  —  Y ||  — >  0  as  i  — >  oo. 

Convergence  in  M77  is  thus  reduced  to  convergence  in  R.  Moreover,  if  Xk  = 
(if,  . . . ,  x„)  and  Y  =  (y\, ,  yn)  then 

lim  Xk  =  Y  lim  xf  =  yt,  1  <  i  <  n. 

k — >  oo  k — >  oo 

A  function  F:U  C  M77  — >  M777  is  continuous  at  Xo  e  U  if  for  each  sequence 
(X*)^  in  U 


2  The  e  chosen  will  depend  on  Xo  and  to  be  rigorous  we  should  indicate  this  dependence  in  some 

way,  e.g.  by  writing  sx 0.  This  would  lead  to  unnecessarily  complicated  notation.  We  hope  that  this 
simplification  will  not  be  the  source  of  any  confusion. 
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lim  Xk  =  X0^  lim  F(Xk)  =  F(X0). 

k — >  oo  k^oo 

When  this  is  the  case  we  write  limx^Xo  F(X)  =  F(X o).  If  F  is  continuous  at 
all  points  in  U  we  say  F  is  continuous  on  U. 

We  could  also  define,  in  different  ways,  a  length  function  on  MWjW,  for  instance 
by  identifying  Mm>w  with  Wnn .  Any  such  standard  definition  will  be  equivalent  to 
the  following:  if  Ak  g  Mm>w  for  all  k,  Ak  =  (ak .)ij  and  A  =  ( aij)ij  then 

lim  Ak  =  A  4=>  lim  ak  •  =  at  / 

k—>oo  k — >oc 

for  all  (/,  j),  1  <  i  <  m,  1  <  j  <  n.  We  may  now  define  for  positive  integers  /,  m 
and  ft  continuous  mappings  from  U  C  — >  Mm^. 

A  function  /:  A  C  M"  ^  I  has  a  maximum  on  A  if  there  exists  X\  e  A  such  that 
f(X)  <  f(X i)  for  all  X  g  A.  We  call  /(Xi)  the  maximum  of  /  on  A  and  say  that 
/  achieves  its  maximum  on  A  at  X\ .  The  maximum,  if  it  exists,  is  unique  but  may  be 
achieved  at  more  than  one  point.  A  point  X  i  in  A  is  called  a  local  maximum  of  /  on 
A  if  there  exists  8  >  0  such  that  /  achieves  its  maximum  onAH{I  :  \\X  —  X\\\  <  5} 
at  X\.  If,  in  addition,  f(X )  <  f(X i)  whenever  X  ^  Ii  we  call  X\  a  strict  local 
maximum.  Isolated  local  maxima  are  strict,  i.e.  if  for  some  8  >  0,  X\  is  the  only 
local  maximum  of  /  in  A  n  {X  :  \\X  —  X\\\  <5}  then  X\  is  a  strict  local  maximum. 
In  particular,  if  the  set  of  local  maxima  of  /  infinite  then  all  local  maxima  are  strict 
local  maxima.  The  analogous  definitions  of  minimum,  local  minimum  and  strict  local 
minimum  are  obtained  by  reversing  the  above  inequalities. 

Compact  sets  and  continuity  feature  in  the  following  fundamental  existence  the¬ 
orem  for  maxima  and  minima. 

Theorem  1.3  A  continuous  real-valued  function  defined  on  a  compact  subset  ofW 1 
has  a  maximum  and  a  minimum. 

The  practical  problem  of  finding  maxima  and  minima  often  requires  a  degree  of 
smoothness  finer  than  continuity  called  differentiability.  Continuity  and  differentia¬ 
bility  of  most  functions  we  encounter  can  be  verified  by  using  functions  from  R  into 
R,  addition,  multiplication,  composition  of  functions  and  linear  mappings. 

Definition  1.4  Let  F:U  c  M/7  — >  Mm,  U  open,  and  let  Xo  e  U.  We  say  that  F 
is  differentiable  at  Xo  £  U  if  there  exists  a  function  A  :  U  — >  Mm?„  which  is 
continuous  at  Xo  such  that 

F(X)  =  F(X o)  +  A(X)(X  -  X0)  (1.1) 


for  all  X  eU. 

If  f:(a,  b)  c  R  — >  R  is  differentiable  at  xo  e  (a,  b)  in  the  classical  sense,  that 
is  if 


f'(x  o)  =  lim 


fix)  -  fix o) 
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is  finite,  then  letting 


A  (  x  _  (/(*)  -  fix  o))/(x  -  x0),  if  X  ^  x0, 

W  l/'(*0).  if  x  =  x0, 

we  see  that  /  is  also  differentiable  in  the  sense  of  Definition  1.4.  By  using  (1.1)  we 
see  that  the  converse  is  also  true. 

The  function  A  in  Definition  1 .4  is  not  necessary  unique,  However,  if  the  matrix¬ 
valued  functions  A  and  B  satisfy  F(X)  =  F(X o)  +  A(X)(X  —  Xo)  and  F(X)  = 
F(X o)  +  B(X)(X  —  Xo)  and  both  are  continuous  at  Xo  then 

lim  A(X)  =  lim  B(X) 

X^Xq 


and  we  call  the  common  value  the  derivative  of  F  at  Xo  and  denote  it  by  both  DF(X o) 
and  F'(Xo).  If  /  is  a  scalar-valued  function  we  also  write  V/(X)  in  place  of  f'(X ) 
and  call  V/  the  gradient  of  /.  Gradient  is  just  another  word  for  slope  and,  in  one 
variable  calculus  the  derivative  is  the  slope  of  the  tangent  line  to  the  graph  of  the 
function.  Note  that  if  F:  W1  — >  Mm  then  F'(X)  is  an  m  x  n  matrix  (i.e.  the  order  is 
reversed)  and  is  identified  with  a  linear  mapping  from  R”  into  Rm .  The  term 

F(X o)  +  F\X 0)(X  -  X0) 

is  our  linear  approximation  to  F(X)  near  Xo. 

If  F  is  differentiable  then  F  is  continuous.  The  sum,  difference  and  scalar 
multiple  of  differentiable  functions  are  differentiable  and  the  product  of  scalar¬ 
valued  and  vector-valued  differentiable  functions  are  differentiable.  We  have  (see 
also  Exercise  1.7)  the  following  formulae  whenever  F,  G  :  U  C  — >  Mm, 

/  :  U  c  IT  — >  R  are  differentiable,  a,  b  e  R,  A  e  Rm,  T  :  M7Z  — >  Rm  is  linear 
and  (A  +  T)(X)  =  A  +  T(X)  for  all  X  e  Mm: 

(i)  (aF  +  bG)f(X)  =aF'(X)  +  bG'(X), 

(ii)  (/  •  G)r(X)  =  /(X)  •  Gr(X)  +  G(X)  o  V/(X), 

(iii)  (A  +  T)'(X)  =  T. 

Part  (ii)  above  is  the  product  rule  for  differentiation.  Note  that  •  denotes  scalar  mul¬ 
tiplication,  while  o  denotes  matrix  multiplication  of  the  m  x  1  matrix  G(X)  and 
the  1  x  n  matrix  V/(X).  Part  (iii)  tells  us  that  the  derivative  of  a  linear  function  is 
constant  and  equal  to  itself  at  all  points  while  the  derivative  of  a  constant  function  is 
0  (in  the  appropriate  space). 

The  composition  of  differentiable  functions  is  again  differentiable  and  the  chain 
rule ,  (1.2),  gives  the  derivative  of  the  composition  in  an  elegant  form.  Let 

F  :  U  (open)  c  —>  Rm,  G  :  V  (open)  c  Mm  R p 


6 


1  Introduction  to  Differentiable  Functions 


denote  differentiable  functions.  If  X  e  U  and  F(X)  e  V  then  (G  o  F)'(X)  exists 
and 


D(GoF)(X)  =  (GoF)'(X)  =  G'(F(X))oF'(X)  =  DG(F(X))o DF(X).  (1.2) 


If  /  :  U  cr  — >  R  is  differentiable  at  X  =  (xi , . . . ,  xn)  e  U  then  V/ (X)  is  a 
1  xn  matrix,  that  is  a  row  matrix  with  n  entries.  If  e*  6  R77  and  gi(x)  =  X  +  xe*  for 
all  x  e  R  then  gi  is  differentiable  and  g-(x)  =  [g-(v)](ei)  =  e*  at  all  x  g  R.  Since 

/  O  g;(x)  =  /(X  +  xe,0  =  f(x  1,  . .  .,Xi-i,Xi  +x,...,xn) 


the  composition  f  o  gi  :  R  — >  R  is  differentiable  and 


d(  f  O  gi )  ./Gl  ?  •  •  •  5  Xi  —  1 ,  Xi  +  X,  .  .  .  ,  Xn)  f  {x\  ,  .  .  •  ,  Xi  —  l ,  Xi ,  .  .  .  ,  Xn) 

- - - (X)  =  lim  - 

ax  x 

=  ^-(X)  =  v/(g(0))(g{(0))  =  V/(X)(e,). 

OX; 


This  shows  that 


V/(X)  =  Df  (X)  =  /'(X)  = 


) 


and,  if  go  =  2j=i  (°jej  G 


n 


n 


Vf(X)(a>)  =  V/(X)(X©jey)  = 

./  =  !  7=1  7 


If  F:G  C  M77  — >  M777  then  F(X)  has  m  coordinates  /i(X),  . . . ,  /m(X)  and  we 
often  write  F  =  (/i ,  . . . ,  fm)  where  each  ft  is  a  real- valued  function  of  n  variables. 
Hence,  F  =  fiei  and  F  is  differentiable  if  and  only  if  each  ft  is  differentiable. 
If  co  =  cojej  e  ^ n  then, 


n 


m 


n 


F'(X)(£ cojej)  =  C-Z  I 

7  =  1  i= 1  7  =  1  *  =  1  7  =  1 


m 


dft 

CDi  —  (X) 


J 


9x 


7 


e,- 


,  dfi 

and  hence,  for  1  <  j  <  /i,  1  <  /  <  ft,  we  have  (F  (X)e,-,e/)  =  - (X)  and 

9x 


9x 


'j 


(X)  is  the  (/,  7)  entry  in  the  m  x  n  matrix  DF(X)  =  F7(X), 


j 


If  v  g  M77  then  the  function  G  :  x  g  R  — >  X  +  xv  g  M77  is  differentiable 
at  any  X  g  M77  and,  as  above,  G7(x)  =  G7(x)(ei)  =  v  for  all  x  g  R.  If  F  : 
G  (open)  g  M77  — >  M777  is  differentiable  at  X  then  (FoG)7(0)  =  F7(G(0))  oG7(0)  = 
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F'(X)(v).  We  write  DyF(X)  and  FV(X)  in  place  of  D(F  o  G)( 0)  and  call  DVF(X) 
the  directional  derivative  of  F  at  X  in  the  direction  v.  Since 

F(X  +  x\)  =  F(G(x))  =  F(G(0))  +  A(x)x  =  F(X)  +  A(x)x 

where  lirn^o  A(x)  =  DyF(X)  we  have 

F(X  +  xv)  -  F(X) 


DVF(X)  =  lim 


(1.3) 


dF 

If  v  =  e/  we  write  — (X)  and  FXi  (X)  in  place  of  DtiF(X)  and  call  these  the 

dxi 

(first-order)  partial  derivatives  of  F. 

If  F  is  differentiable  at  X  then  all  (first-order)  directional  and  partial  derivatives 
of  F  exist  and  we  have  shown 


F'(X)  = 


V/i(X) 


V/m(X) 


... 

9/i 

dxn 

dfm(X)  ... 

V  9xi 

dfm 

dxn 

(X) 


\ 


(X) 


=  (Fxl(X),...,FXn(X)) 


J 


If  v  =  (vi , . . . ,  vn)  gR"  this  shows  that 


n 


n 


dF 


DyF(X)  =  F'Q. 0(V)  =  Y,v<FXi(X)  =  ^Vi—(X). 


i  =  1 


i  =  1 


If  all  first-order  partial  derivatives  of  F  exist  and  are  continuous  then  F  is  differ¬ 
entiable.  This  criterion  enables  us  to  see,  literally  at  a  glance,  the  sets  on  which  most 
functions  have  derivatives,  partial  derivatives  and  directional  derivatives.  Consider, 
for  instance,  the  function 


F(x,  y,  z)  =  (sin(xyz),  x2  -  y2 ,  exp(xy)) 


We  have 


F(x,  y,  z)  =  sin(xjz)ei  +  (x2  -  y2)e2  +  exp(xy)e3 

and  it  suffices  to  consider  separately  the  three  R- valued  functions  sin(vyz),  x2  —  y2 
and  exp(vy).  Since  linear  functions  are  continuous  and  the  composition  and  product 
of  continuous  functions  are  continuous  we  see,  on  calculating  the  partial  derivatives, 
as  in  Example  1.1,  that  all  the  first  order  partial  derivatives  exist  and  are  continuous. 
Hence  F  is  differentiable. 
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Higher-order  directional  and  partial  derivatives  are  defined  in  the  usual  way,  i.e. 
by  repeated  differentiation.  If  F  :  U  C  M/7  — >  Rm  is  differentiable  then  each  row 
in  F'(X )  corresponds  to  a  coordinate  function  of  F  while  each  column  corresponds 
to  an  (independent)  coordinate  variable. 

Example  1.5  Let  /PM4  ->  M3  be  defined  by 

F(x,  y,  z,  w)  =  (x2y,  xyz,  x2  +  y2  +  zw2). 


Then  F  =  (/j,  f2,  fo)  where  f\  (x ,  y,  z,  w)  =  x2y,  f2 (x ,  y,  z,  w)  =  xyz  and 
/3(x,y,z,w)  =  x2  +  y2  +  zw2.  Moreover,  V/j(x,  j,  z,  w)  =  (2xy,  x2,  0, 0), 
Vf2(x,  y,  z,  w)  =  (yz,  xz,  xy,  0)  and  V/3(x,  y,  z,  w)  =  (2x,  2 y,  w2,  2 zw).  Hence 


/ 2 xy  x2  0 
F’ix.  y,  z,w)  =  I  yz  xz  xy 

\  2x  2y  w2 

If  X  =  (1, 2,  —  1,  —2)  and  v  =  (0,  1,  2,  —2)  then 


/4  1 

DVF(X)  =  F'(X)  orv  =  I  —2  -1 

V  2  4 


0  0\ 
2  0  I 
4  4/ 


V-2/ 


Associated  with  any  function  F:U  C  Pi"  — >  R”!  we  have  two  types  of  sets  which 
play  a  special  role  in  the  development  of  the  subject — graphs  and  level  sets.  The 
graph  of  F  is  the  subset  of  P:"+m  defined  as  follows 


graph  (F)  =  {(X,  Y)  :  X  e  U  and  Y  =  F(X)} 

=  {(X,F(X))  :X  e  U}. 

If  C  =  (ci , . . . ,  cm)  is  a  point  in  R'"  we  define  the  level  set  of  F,  F~l(C),  by  the 
formula 

F~l(C)  =  {X  e  U  :  F(X)  =  C}. 


In  terms  of  the  coordinate  functions  offwe  have 


F(X)  =  C  fi(X)  =  ci  for  i  = 


F~\C)  =  f|{X  e  U  :  MX) 

i= 1 


m 

= = n  f-\Ci). 


i=i 


and  hence 
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Fig.  1.1 


Thus  a  level  set  of  a  vector-valued  function  is  the  finite  intersection  of  level 
sets  of  real- valued  functions.  This  is  frequently  useful  in  arriving  at  a  geometrical 
interpretation  of  level  sets  as  the  following  example  shows. 

Example  1.6  Let  F(x ,  y,  z)  =  ( x 2  +  y2  +  z2  —  1,  2x2  +  2 y2  —  1).  We  have  F  = 
(/l,  fi)  where  /i(x,  y,  z)  =  v2+y2+z2- 1  and  f2(x,  y,  z)  =  2v2+2y2- 1.  The  set 
//” 1  (0)  is  a  sphere  of  radius  1  while  /2_ 1  (0)  is  a  cylinder  parallel  to  the  z-axis  built  on 

a  circle  with  centre  the  origin  and  radius  1  / \fl.  If  0  =  (0,  0)  is  the  origin  in  M2  then 

^-1(0)  =  /r1(0)n/2-1(0) 

is  the  intersection  of  a  sphere  and  a  cylinder  in  M3  (Fig.  1.1a). 

For  this  particular  example  we  obtain  more  information  by  solving  the  equations 
fi(x,  y,  z)  —  fiix,  y,  z)  =  0.  We  have  x2  +  y2  =  1  —  z2  =  1/2.  Hence  z2  —  1/2, 
z  =  ±l/\/2  and  the  level  set  consists  of  two  circles  on  the  unit  sphere  (Fig.  1.1b). 

The  relationship  between  graphs  and  level  sets  plays  an  important  role  in  our 
study.  The  easy  part  of  this  relationship — every  graph  is  a  level  set — is  given  in  the 
next  example  while  the  difficult  part — every  (regular)  level  set  is  locally  a  graph — is 
the  implicit  function  theorem  (Chap.  2). 


Example  1.7  Let  F:U  C  M/7  Mm.  We  define  G:U  x  Mm  — >►  Mm  by  G(X ,  Y) 
=  F(X)  —  Y.  If  0  is  the  origin  in  Mm  then 

(X,  y)  e  G“‘(»)  G(X,Y)  =  0 

«=►  F(X)-Y=  0 

(X,  Y)  e  graph  (F). 


Hence  G  1  (0)  =  graph  (F)  and  every  graph  is  a  level  set. 
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Exercises 


1.1.  Sketch  the  following  sets 


(a) 

(b) 

(c) 

(d) 

(e) 


x  y  z 

■  (x,  y,  z)  :  <  1 

az  bz  cz 

{(x,y ,z)  ■ x2  +  y 2  =  z2} 

{(*,  y,  z)  :  x  >  0,  y  >  0,  z  >  0} 

{(x,  y,  z)  :  x2  +  y2  +  z2  -  2z  =  0} 

{(x,  y,  z)  :  x2  +  y2  +  z2  -  4z  =  0,  x2  +  y2 


From  your  sketches  determine  which  of  the  sets  are:  open,  closed,  bounded, 
compact. 


1.2.  Find  all  first-order  partial  derivatives  of 


(a)  f{x,  y,  z)  =  (z2  +  x2)  log(l  +  x2y2) 

(b)  g(x,  y,z)  =  xytan_1(xz) 

sin(x2  +  y2  +z2  +  w2) 

(c)  h(x,  y,  z,  w)  = - — — - -» - 

1  +  (x  -  yY 


1.3.  If  F{x,  y,  z,  w)  =  (x2  —  y2,  2 xy,  zx,  z2w2x2)  and  v  =  (2,  1,  —2,  —1)  find 
F'i  1,  2,  -1,  -2)  and  DVF(  1,  2,-1,  -2). 

1.4.  Let  fix,  y,  z)  =  x2  —  xy  +  yz 3  —  6  z.  Find  all  points  (x,  y,  z)  such  that 
V/(x,  y,  z)  =  (0,  0,  0). 

1.5.  If  fix,  y,  z)  =  x2ev  and  g(x,  y,  z)  =  y2exz  find  V/,  Vg  and  V(/g).  Verify 
that  V(/g)  =  fVg  +  gVf. 

1.6.  Let  P :  (a,  Z?)  c  R.  — >  R".  Show  that 


^(||/>(t)||2)  =  2P(f)  o  p'it)  =  2 (Pit),  P\t )) 


and  deduce  that  if  HP  (Oil  does  not  depend  on  t  then  P\t)  _L  P(t). 

1.7.  If  Fix,  y,  z)  =  (x2,  y2  +  z2,  xyz)  and  G(x,  y,  z)  =  (ex ,  y2  -  z2,  xyz)  find 
F'ix,  y,  z)  and  G'ix,  y,  z).  Let  H(x,  y,  z)  =  (Fix,  y,  z),  G(x,  y,  z))  where 
(  ,  >  is  the  inner  product  in  R3.  Find  VH  (x,  y,  z.)  and  verify  that 


VH(x,  y,  z)  =  G(x,  y,  z)  °  F'ix,  y,  z)  +  F(x,  y,  z)  o  G\x,  y,  z). 
1.8.  If  fix,  y,  z)  =  (x2  +  y2  +  z2)-1/2  show  that 

jx,y,z) 

(x2  +  y2  +  z2)3/2 


V/(x,  y,  z)  = 
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1.9.  If  F:U  C  M/7  — >  Mm  is  differentiable  and  F(P)  ^  0  show  that  ||F||  is 
differentiable  at  the  point  P .  If  v  e  W1  show  that 

(DyF,  F) 

Vv(||F||)(P)=  *  ,  ;( P ). 

F 


1.10.  Use  Exercise  1.9  to  give  another  proof  of  Exercise  1.8,  i.e.  first  find  Dei 
for  i  =  1,  2,  3. 


1.11.  If 


(x,  y,  z) - >(xyz,  x2  +  y2,  x2  -  y2,  z2) 

{u,  V,  w,  t) - >(w2  +  v2,  u2  —  v2 ,  w2  —  t2 ,  w2  +  t2) 


G  =  (G i,  G2,  G 3,  G4)  and  H  =  (Hi,  H2,  #3,  H4)  =  G  o  F  verify  that 


3H2  dG  2  du  dG2  dv  dG2  dw  dG2  dt 

dx  du  dx  dv  dx  dw  dx  dt  dx 

directly  and  also  by  using  H'  =  Gr  o  F' . 

1.12.  If  the  function  f:W 7  — >  R  satisfies 


0 


show  that  h(x  ,xn)  =  /(e*1 , . . . ,  tXn )  satisfies 


1.13.  Let  f\W  — >►  R  be  differentiable,  let  X  e  M/7  and  =  (Z\vi,  . . . ,  Axn)  e 
M/7.  Show  that  there  exists  g:M?n  —>  R  such  that 


f(X  +  AX)  =  f(X)  +  YJ 

i  =  1 


(X)Z^+g(X,zAX)||^X|| 


and  g(X,  zAX)  ->  0  as  ->  0. 

1.14.  Let  f(x,  y,  z)  =  x2y 2 +y2z2+xyz.  By  using  the  previous  exercise  and  the  val¬ 
ues  of  /  and  its  first-order  derivatives  at  (1,  1,  1)  estimate  /( 1.1,  1.05,  0.95). 
Find  the  error  in  your  approximation  and  the  error  as  a  percentage  of  /  ( 1 ,  1 ,  1 ) . 

1.15.  Identify  geometrically  and  sketch  the  level  set  E_1(C)  where  F(x,  y,  z)  = 
(z2  —  x2  —  y2,  2x  —  y)  and  C  =  (1,  2). 
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1.16.  If  A  is  a  subset  of  W2  show  that  A  is  closed  if  and  only  if  for  all  Xo  e  W2  and 
(Xk)?=l  C  A,  lim^^oo  Xk  =  X0  implies  X0  €  A. 

1.17.  If  A  is  a  subset  of  W2  show  that  A  is  open  if  and  only  if  for  each  Xo  e  A  and 
(X,)£i  C  M",  lim^oo  Xk  =  Xo  there  exists  a  positive  integer  ko  such  that 
Xk  e  A  for  all  k  >  ko. 

1.18.  Show  that  the  sum,  product  and  quotient  of  continuous  (respectively  differen¬ 
tiable)  functions  are  continuous  (respectively  differentiable)  and  that  differen¬ 
tiable  functions  are  continuous. 

1.19.  Show  that  the  level  set 

y6  +  y2x 2  +  x2  +  y2  +  1 6z2  —  8  vyz  —  2xy  =  51 

is  a  compact  subset  of  M3 . 

1.20.  If  0  :  M3  — >►  M3  is  differentiable  and  F  :  M3  — >  M4  is  defined  by  F(x,  y,  z)  = 
(x,y,z,(p(x,y,z))  find  F' . 

1.21.  If  A  is  a  symmetric  m  xn  matrix  and  X  and  Y  are  eigenvectors  corresponding 
to  different  eigenvalues  show  that  (X,  Y)  =0. 


Chapter  2 

Level  Sets  and  Tangent  Spaces 


Summary  Using  systems  of  linear  equations  as  a  guide  we  discuss  the  significance 
of  the  implicit  function  theorem  for  level  sets.  We  define  the  tangent  space  and  the 
normal  space  at  a  point  on  a  level  set. 


We  shall  be  concerned  with  many  different  aspects  of  surfaces,  level  sets  and  graphs  in 
this  book.  In  this  chapter  we  consider  the  role  of  differentiability  in  the  local  structure 
of  level  sets.  By  considering  the  linear  approximation  of  differentiable  functions  and 
standard  results  on  solving  systems  of  linear  equations  we  see  that  level  sets  are 
locally  graphs. 

Let  F :  U  c  R”  — >  Mm,  U  open,  F  =  (f\ ,  . . . ,  /m),  C  =  (c\ , . . . ,  cm)  e  M.m .  We 
suppose  that  F  is  differentiable  and  consider  the  level  set  F~l  (C)  =  n-Li  ./r'fe), 
i.e.  the  points  (x\,  . . . ,  xn)  e  U  which  satisfy  the  equations 


f  \  (-M  >  •  •  •  5  %n )  —  £  1 

;  (2.D 

fm  (x  1 ,  .  .  .  ,  Xn )  —  Cm . 

We  have  n  unknowns,  x\ ,  . . . ,  xn  and  m  equations.  If  each  f  is  linear  we  have  a  system 
of  linear  equations  and  the  rank  of  the  matrix  of  coefficients  gives  information  on 
the  number  of  linearly  independent  solutions  and  procedures  on  how  to  identify  a 
complete  set  of  independent  variables.  The  implicit  function  theorem  says  that  this 
process  is  also  valid  locally  for  differentiable  functions.  The  key,  of  course,  is  the  fact 
that  differentiable  functions,  by  definition,  enjoy  a  good  local  linear  approximation. 

Fix  a  point  P  e  F~l{C)  and  suppose  X  e  W1  is  close  to  0  and  P  +  X  e  F~l{C). 
Since  F  is  differentiable, 

F(P  +  X)  =  F(P )  +  F'(P)(X )  +  G(X)(X) 


where  G  :  U  — »  Mm>„  and  G(X)  ->  0  as  X  0.  Since  F(P  +  X)  =  F(X)  =  C, 

F'(P)(X)  %  0 
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(where  ^  denotes  approximately  equal). 

We  assume  from  now  on  that  n  >  m.  We  thus  have  something  very  close  to  the 
following  system  of  linear  equations 


dfi 

dx\ 


d  fm 
dx\ 


(P)X  1  + 

(P)x  1  + 


The  matrix  of  coefficients  of  this  system  of  linear  equations  is 


1  <i<m 
1  <j<n 


(2.2) 


i.e.  F'(P).  From  linear  algebra  we  have 


Rank  (Ff(P))  =  m 


the  m  rows  of  F'(P)  are  linearly  independent 
4=>  there  exist  m  linearly  independent  columns 
in  F\P) 


F\P)  contains  m  columns  such  that  the 
associated  m  x  m  matrix  has  non-zero 
determinant 

4=>  the  space  of  solutions  for  the  system  (2.2) 
is  n  —  m  dimensional. 


Moreover,  if  any  of  these  conditions  are  satisfied  and  we  choose  m  columns  which 
are  linearly  independent  then  the  variables  corresponding  to  the  remaining  columns 
can  be  taken  as  a  complete  set  of  independent  variables  and  the  full  set  of  solutions 
coincides  with  the  graph  of  a  function  of  the  independent  variables.  If  the  above 
conditions  are  satisfied  we  say  that  F  has  full  or  maximum  rank  at  P. 

As  a  simple  example  consider 


2x  —  y  +  z  =  0 
y  —  w  =  0 


with  matrix  of  coefficients 
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The  2  x  2  matrix  obtained  by  using  the  first  two  columns  is  lo  ,J  and  this  has 

determinant  2^0.  Hence  A  has  rank  2  and  the  two  rows  are  linearly  independent. 
Since  the  first  two  columns  are  linearly  independent  we  can  take  the  remaining  two 
variables,  z  and  w,  as  the  independent  variables.  We  have  y  =  w,  2x  =  y  —  z  =w  —  z 
and  so  {((w  —  z)/ 2,  w,  z,  w)  :  z  e  R,  w  e  M}  is  the  solution  set  for  this  system  of 
equations.  We  can  write  this  in  the  form 


{{4>(z,  w),  z,  w)  :  (z,  w)  e  M2} 

where  0(z,  w)  =  ((w  —  z)/ 2,  w)  and  in  this  format  the  solution  space  is  the  graph 
of  the  function  0  (see  Example  1.7). 

Note  that  columns  1  and  3  are  not  linearly  independent,  since  the  corresponding 

(2  1\ 

2x2  matrix  I  1  has  zero  determinant,  and  we  cannot  choose  y  and  w  (the 

remaining  variables)  as  the  independent  variables. 

Assuming  that  the  rows  of  F'(P)  are  linearly  independent  is  equivalent  to  requir¬ 
ing  that  {V/i  (P),  ... ,  V/m(P)}  are  linearly  independent  vectors  in  Wl.  The  implicit 
function  theorem  says  that  with  this  condition  we  can  solve  the  non-linear  system  of 
Eq.  (2.1)  near  P  and  use  the  same  method  to  identify  a  set  of  independent  variables. 
The  hypothesis  of  a  good  linear  approximation  in  the  definition  of  differentiable 
functions  implies  that  the  systems  of  Eqs.(2.1)  and  (2.2),  are  very  close  to  one 
another. 

We  now  state  without  proof  the  Implicit  Function  Theorem. 

Theorem  2.1  (Implicit  Function  Theorem)  Let  F :  U  C  M/7  — >►  Mm,  m  <  n,  denote 
a  differentiable  function,  let  P  e  U  and  suppose  F(P)  =  C  and  rank  ( F'(P ))  =  m 
(for  convenience  we  suppose  that  the  final  m  columns  of  F'(P)  are  linearly  indepen¬ 
dent).  If  P  =  (pi,  ...,Pn)  let  Pi  =  (pi,  .  .  .  ,  Pn-m)  and  P2  =  (Pn-m+1,  •  •  •  ,  Pn) 
so  that  P  =  (Pi,  Pf).  Then  there  exists  an  open  set  V  in  Wl~m  containing  P\,  a 
differentiable  function  0 :  V  —>  Mm,  an  open  subset  W  of  U  containing  P  such  that 
(j)(P\)  =  P2  and 


F~l(C )  n  W  =  ■  (X,<I>(X))  :  X  ev\  =  graph  (</>). 


Thus  locally  every  level  set  is  a  graph  (Fig.  2.1). 


Example  2.2  Fet  F(x i,  x2,  X3,  xf)  =  (x\x2,  x2x^).  We  have 

.  /  V/i(vi,  x2,  X3,  X4)  \  (x2  xi  0 

F  (xi,x2,  X3,  X4)  =  \  1  =  1 

\V/2(^L^2,^3,^4)/  \°  0  x4 


0 


4  2X3X4 


) 


where  fi(xi,  x2,  X3,  xf)  =  x\x2  and  f2(xi ,  x2,  X3,  xf)  =  X3x\.  Consider  the  level 
set  F~l( 2,  —4)  and  the  point  P  =  (1,2,  —1,2)  e  F~l( 2,  —4).  We  have 
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Fig.  2.1 


Pi  Xi  Mn-m 


F\  1,2, -1,2) 


/2  1  0  0  \ 
\0  0  4  -4/' 


If  of (2,  1,  0,  0)  +  /3(0,  0,  4,-1)  =  (0,  0,  0,  0)  then  (2a,  a,  4f>,  ~/3)  =  (0,  0,  0,  0) 
and  hence  a  =  f  =  0.  This  implies  that  the  rows  of  F'(\,  2,  —  1,  2)  are  linearly 
independent  and  F'(P )  has  rank  2.  This  can  be  seen  even  more  rapidly  by  finding  a 
2x2  submatrix  with  non-zero  determinant,  e.g.  if  we  use  columns  2  and  3  we  get  the 

matrix  c  :>  with  determinant  4  ^  0.  It  is  easily  checked  that  the  following  pairs 

of  columns  are  linearly  independent  (1,  3),  (1,  4),  (2,  3)  and  (2,  4)  while  the  pairs 
(1,2)  and  (3,  4)  are  not  linearly  independent.  Since  columns  1  and  4  are  linearly 
independent  we  know  that  the  variables  %2  and  V3  can  be  chosen  as  a  complete  set  of 
independent  variables.  Thus  we  know  that  x\  and  X4  can  be  expressed  as  functions 
of  X2  and  V3  near  the  point  (1,2,  —1,2).  The  implicit  function  theorem  is  important 
because  it  tells  us  that  certain  functions  exist  even  though  it  does  not  show  how 
to  find  them.  In  general,  we  would  have  to  solve  the  system  of  Eq.  (2.1)  to  find 
these  functions  and  this  may  often  be  extremely  difficult  or  even  impossible  in  any 
reasonable  fashion.  In  our  rather  simple  situation  we  have  the  two  equations 


X\X2  =  2 
x^x\  —  —4 

and  can  find  a  solution.  We  have  x\  =  2/x2  and  since  (x\,  X2,  *3,  X4)  is  close  to 
(1,  2,  —  1,  2)  we  have  X2  close  to  2  and  the  natural  domain  for  X2  from  this  equation 
is  X2  >  0.  We  have  x\  =  — 4/^3  and  since  V3  is  close  to  —1  we  take  V3  <  0.  Hence 
—41x2  is  positive  and  X4  =  ±^/—4/x2.  Since  X4  is  close  to  2  we  take  the  positive 
square  root.  Thus  the  function  0,  whose  existence  is  foretold  by  the  Implicit  Function 
Theorem,  has  the  form 

l  2 

<; P(x2,X3 )  =  I  — ,  + 

\X2 
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on  the  open  set  V  =  {(x2,  x$)  :  X2  >  0,  V3  <  Oj.  After  a  rearrangement  of  the 
variables  we  get 


:  X2  >  0,  V3  <  0  ► 

and,  as  expected,  we  have 


An  examination  of  the  equations  x\X2  =  2  and  X3X2  =  —4  shows  that  it  is  not 
possible  to  find,  say  x$,  as  a  function  of  x\  and  X2  and  thus  we  cannot,  as  expected, 
use  x\  and  X2  as  the  independent  variables. 

Example  2.3  Given  the  equations 

x2  —  y2  +  u2  +  2v2  =  1  (2.3) 

x2 +  y2  -u2  -v2  =  2  (2.4) 


we  wish  to  find  all  (x,  y,  u,  v)  such  that  near  (x,  y,  u,v),  u  and  v  can  be  expressed 

du  dv 

as  differentiable  functions  of  v  and  y  and  we  also  wish  to  find  —  and  —  in  terms 

dx  dx 

of  v,  y,  u  and  v. 

Let  F(x,  y,  u,  v)  =  (x2  —  y2  +  u2  +  2v2,  x2  +  y2  —  u2  —  v2).  Then  (. x ,  y,  u,  v) 
satisfies  (2.3)  and  (2.4)  if  and  only  if  F(x,  y,  u,  v)  =  (1,2),  i.e.  if  and  only  if 
(x,  y,  u ,  v)  g  F_1((l,  2)).  We  have 


F\x,  y,  u,  v) 


/  2x 

-2  y 

2  u 

4v  \ 

\2x 

2  y 

—2  u 

— 2v  ) 

We  require  v  and  y  as  a  complete  set  of  independent  variables  and  so  must  have 
linear  independence  of  the  third  and  fourth  columns.  Hence  we  wish  to  find  the 

2  u  4v 
—2  u  — 2v 

point  (x ,  y ,  u ,  v)  satisfying  (2.3)  and  (2.4)  with  u  and  v  both  non-zero  will  be  suitable. 
du  dv 

To  compute  —  and  —  we  could  apply  the  chain  rule  to  the  equation 


points  (x,  y,  u,  v)  such  that  det 


( 


) 


=  4 mv  7^  0.  This  implies  that  any 


dx 


d}’ 


F(x,  y,  u(x,  y),  v(x,  y))  =  (1,  2) 
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or  just  use  (2.3)  and  (2.4)  which  now  read 

x2  -  y2  +  u(x,  y)2  +  2v(x,  y)2  =  1 
x2  -\-  y2  —  u(x,  y)2  —  v(x,  y)2  =  2. 

On  differentiating  we  get 


2x  +  2u(x,y)  +4  v(x,y)  -0 

dx  dx 

(2.5) 

du  dv 

2x  2  u(x,y)  2  v(x,y)  -0. 

dx  dx 

(2.6) 

The  method  of  differentiation  used  to  obtain  Eqs.  (2.5)  and  (2.6)  is  often  called 
“implicit  differentiation”  especially  in  one- variable  calculus  where  level  sets  of  the 
form  f(x,y )  =0  are  considered. 

du  dv 

These  are  just  two  linear  equations  in  —  and  —  which  are  easily  solved  to  give 

dx  dx 


du  3x  dv  —2x 

—  =  —  and  —  =  . 

dx  u  dx  v 

Notice  how  we  need  u  and  v  to  be  both  non-zero.  In  most  cases  of  this  type  it  is 
not  possible  to  find  explicit  formulae  for  u  and  v  by  solving  equations  similar  to 
(2.3)  and  (2.4) — hence  the  implicit  in  the  implicit  function  theorem.  So  although  in 
general  we  cannot  find  explicit  formulae  for  the  dependent  variables  in  terms  of  the 
independent  variables  we  can  find  the  partial  derivatives  in  terms  of  the  independent 

and  dependent  variables.  We  choose  this  particular  example  because  we  are  able  to 

du  dv  ?  ? 

verify  our  formulae  for  —  and  — .  Adding  (2.3)  and  (2.4)  gives  2x  +  v  =  3  and 

dx  dx 

hence  v  =  ±(3  —  2x2)  ^2  where  we  take  the  appropriate  sign  depending  on  the  value 
of  v.  We  have 


dv 

dx 


1 

=b-(3 

2 


2x2r1/2(-4x) 


—2x 

±(3-2x2)1/2 


—2x 

v 


Subtracting  (2.4)  from  (2.3)  gives 

—2y2  +  2u2  +  3v2  =  -1  =  —2y2  +  2 u2  +  3(3  -  2x 2) 


i.e.  2 u2  =  2y2  +  6x2  —  10  and  u  =  ±(3x2  + 

—  =  ±-(3x2  +  y2-  5)-i/2  •  6x  = 
dx  2  * 


y2  —  5)1/2.  Hence 
3x 

'  ±(3x2  +  y2  —  5)1/2 


3x 

u 


and  this  agrees  with  our  earlier  calculation. 
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We  now  return  to  the  general  situation  and  let 

F  =  (fi, . . . ,  fm) :  U  (open)  C  M"  ^  Mm 

denote  a  differentiable  function  which  has  full  rank  at  the  point  P  in  U .  Let  F(P)  = 
C.  We  have 

F_1(C)  =  {X  :  F(X)  =  C}  =  {X  :  F(P  +  X  -  P)  =  C] 

=  {X  :  F(P )  +  F'(P)(X  -  P)  +  G(X)(X  -  P)  =  C] 

%  {X  :  F\P)(X  -  P)}  =  0} 

=  {X  :  <V/KP),  X-P)=0,i  =  l,...,m} 

where  G  :  U  — >  Mm,n  and  G(X )  — >  0  as  X  — >  P . 

The  set 

{X  e  :  F\P)(X  -  P)  =  0}  =  P  +  {X  e  :  F\P)(X)  =  0} 

is  the  closest  linear  approximation  to  F~{  (C)  near  P  and  we  call  it  the  tangent  space 
to  F~l{C)  at  P.  Since 

f'(P)  =  (^/L(P))i  . 

\OXj  )  1  <i<m 
1  <j<n 

the  set  of  all  X  satisfying  the  equation  F'(P)(X)  =  0  is  precisely  the  set  of  all 
solutions  {x\,  . . . ,  xn]  of  the  system  of  homogeneous  linear  equations  (2.2)  that  we 
encountered  earlier  and,  as  F  has  full  rank  at  P,  this  space  of  solutions  forms  an 
(n  —  m) -dimensional  subspace  of  W1.  Since 

{X  e  R"  :  F'(P)(X)  =  0}  =  {X  e  M"  :  (Vfi(P),X)  =  0,  i  =  1, . . . ,  m] 

the  tangent  space  consists  of  the  vectors  which  are  perpendicular  to  the  gradients  of 
the  component  functions  transferred  to  the  point  P  (see  Fig.  2.2).  Normal  vectors 
are  perpendicular  to  tangent  vectors  and  it  is  thus  natural  to  define  the  normal  space 
to  the  level  set  at  P  as 


m 


p  +  4 


J^XiXMP)  :  Xi 

i= 1 


e  R  ► . 


If  the  tangent  space  or  normal  space  is  two-dimensional  we  use  the  term  tangent 
plane  and  normal  plane  respectively  and  if  it  is  one-dimensional  we  use  tangent  line 
and  normal  line  respectively.  The  tangent  space  and  normal  space  are  both  translates 
of  vector  subspaces  of  M/7  to  the  point  P .  The  tangent  space  is  the  subspace  which 
fits  closest  to  the  level  set  of  F  at  P  while  the  normal  space  is  the  set  of  directions 
which  are — roughly  speaking — perpendicular  to  the  surface  near  P. 
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Fig.  2.2 


In  W 7  there  are  various  ways  of  presenting  lines,  planes,  etc.  The  normal  form 
consists  of  a  description  as  the  set  of  points  satisfying  a  set  of  equations  while  the 
parametric  form  is  in  terms  of  independent  variables  and  this,  as  we  shall  see  in 
Chaps.  7  and  10,  is  almost  a  parametrization  of  the  space. 

Example  2.4  Let  S  denote  the  set  of  all  points  in  M3  which  satisfy  the  equation 
x2  +  2y2  —  5 z2  =  1.  We  wish  to  find  the  tangent  space  and  the  normal  line  at  the 
point  (2,  —1,  1)  on  S.  Let  f(x,  y,  z)  =  x2  +  2y2  —  5 z2 .  Then  S  =  /-1(  1).  The 
tangent  plane  at  P  =  (To,  yo,  zo)  in  normal  form  is 

j(x,  y,  Z )  :  V/(x0,  yo,  zo)  •  (x  -  x0,  y  -  yo,  z  -  zo)  =  o| 

=  | (x ,  y,z)  :  (x  -xo)f(P)  +  (y  -  yo)zf(P)  +  (z-  zo)f-(P)  =  ol. 

I  dx  dy  dz  J 

Now  V/(v,  y,  z)  =  (2x,  4 y,  —  10z)  and  V/( 2,  —1,1)  =  (4,  —4,  —10)  and  so  the 
tangent  plane  at  (2,  —1,  1)  is 

{(x,  y,  -)  :  (x  -  2)4  +  (y  +  l)(-4)  +  (z  -  1)(-10)  =  oj 
=  | (x ,  y,  z)  :  2x  -  2y  -  5z  =  l}. 

The  normal  line  at  (vo,  yo^o)  is  the  line  through  (vo,  yo?^o)  in  the  direction  of 
V/ (vo,  yo,  zo)-  In  our  case  we  have,  in  parametric  form,  the  normal  line 

{(2,  -1,  1)  +  t(4,  -4,  -10)  :  t  e  mJ  =  |(2  +  4f,  -1  -  At,  1  -  10?)  :  t  e  rJ. 

To  change  this  into  normal  form  let  v  =  2  +  4t,  y  =  —1—4 1  and  z  =  1  —  10f. 
Solving  for  t  we  get 

x  2  _  y  T  1  _  z  —  1  _ 

4  “  -4  “  -10  “ f 


and  obtain  the  normal  form 


x  —  2 


y  +  i 

-4 


-10 


4 
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To  find  the  tangent  plane  in  parametric  form  we  must  find  two  linearly  independent 
vectors  which  are  perpendicular  to  V/(x o,  yo,  zo)-  Applying  the  observation  that 
(x,  y)  and  (— y,  x)  are  perpendicular  vectors  in  Mr  to  different  pairs  of  coordinates 
in  M3  we  see  easily  that 


and 


(4,  -4, -10).  (4,  4,  0)  =  0 

(4,  -4, -10).  (10,0,  4)  =  0 


and,  moreover,  (4,  4,  0)  and  (10,  0,  4)  are  linearly  independent.  In  parametric  form 
the  tangent  space  at  (2,  —1,  1)  is 

(2,  -1,  1)  +  jx(4,  4,  0)  +  y(10,  0,  4 ):x,ye  m} 

=  1  (2  4"  4x  +  lOj,  —  1  +  4x ,  1  +  4y )  x ,  y  £  IR. 1 . 

Example  2.5  We  wish  to  find  the  normal  plane  and  the  tangent  line  to  the  set  of 
points  satisfying 

x2  +  y2  —  2 z2  =  2  and  xyz  =  2 
at  the  point  (V2,  \/2,  1). 

Let  F(x,  y,  z)  =  (x2  +  y2  —  2 z2,  xyz).  Then  the  set  of  points  which  satisfy  the 
above  equations  form  the  level  set  F_1(2,  2).  We  have 


Ff 


2,  a/2,  l)  = 


2^2  2^2  -4 

V2  V2  2 


The  final  two  columns  form  a  2  x  2  matrix  with  non-zero  determinant  and  hence  F 
has  full  rank  at  (V2,  V2,  1).  The  normal  plane  at  (\/2,  \[2,  1)  has  parametric  form 

(V2,  Vl,  l)  +  lx  (2V2,  2V2,  -4)  +  v  (V2,  V2,  2)  :  x,  y  e  r) 

^V2  +  2\/2x  +  V2y,  \/7.  4"  2\/2x  +  V2y,  1  —  4x  4“  2 y^  x,  y  £  m|. 

To  find  the  normal  plane  in  normal  form  we  must  find  a  non-zero  vector  perpendicular 
to  both  ^2  V2,  2\/2,  —4^  and  ^\/2,  V2,  2^ .  The  cross  product  (see  Chaps.  6  and  7)  of 
the  two  given  vectors  is  of  the  required  type  but  we  take  a  first-principles  approach 
here.  This  amounts  to  finding  (a,  b,  c)  such  that  ^\/2,  V2,  2^  •  (a,  b,  c)  =  0  and 

^2\/2,  2\/2,  —4^  •  (a,  b,  c)  =  0.  We  thus  have  to  solve  the  system  of  equations 
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Via  T-  Vlb  +  2  c  —  0 
Via  T-  Vlb  —  2c  —  0. 

Subtracting  we  see  that  c  =  0  and  we  can  take  a  =  —b  =  1.  Hence  (1,  —1,  0)  is  a 
suitable  vector.  The  normal  plane  (in  normal  form)  is 

{(x,  y,z)  :  (x  ~  V2,  >’  -  V2,  z  -  l)  •  (1,  -1,0)  =  o| 

=  j(x,  y,z)  :  X  -  V2  -  y  +  s/2  =  o| 

=  |(x,  y,z)  :  x  -  y  =  oj. 

The  tangent  line  in  normal  form  is 

j(v,  y,  z)  :  iVlx  +  l\fly  —  4z  =  4,  Vlx  +  V2,y  +  2z  =  bj 
=  |(x,  y,  z)  :  x  +  y  =  2^2,  z  =  lj. 

From  this  we  see  that  the  tangent  line  in  parametric  form  is 

|(r,  2V2  -  t,  1^  :  t  e  rJ. 


Exercises 

2.1.  Let  Fi :  M4  — >  W 


F 1  (xi ,  x2,  x3 ,  x4)  =  x\  -  x|,  Pi  =  (1,2,0,  -1) 

P2(X| ,  X2,  X3,  x4)  =  (Xf  -x|,x|  -X4),  P2  =  (1,0,  2,  -1) 
P3(X1,X2,X3,X4)  =  (xj  -x|,x|  x4 ,  x4  -xp,  P3  =  (1,2,  3,4). 

Calculate  F-(X)  for  X  e  M4  and  find  all  X  such  that  Ff  has  full  rank  at  X. 
When  Fi  has  full  rank  find  all  subsets  of  {x\,  X2,  X3,  X4}  which  can  be  taken 
as  complete  sets  of  independent  variables.  If  Ft  (Pj)  =  C/  and  Ft  has  full  rank 
at  Pi  find  a  function  (pi :  M4_?  — >►  W  such  that  F-_1(Q)  =  graph(0/)  near  Pi. 

2.2.  If  u(x,  y )  and  v(x,  y)  are  defined  by  the  equations  u  cos  v  =  x  and  u  sin  v  =  y 

du  dv 

find  —  and  —  by 

dx  dx 

(i)  finding  explicit  formulae  for  u  and  v 

(ii)  using  implicit  differentiation. 
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2.3.  Let  F(x\,  X2,  X3,  X4)  =  (x\x\,x\X2X2,x2f).  Find  F'{  1,2,  3,4).  Let  A  = 
F'(  1,  2,  3,  4).  Display  the  system  of  equations  AX  =  0.  Solve  this  system 
of  equations  and  find  a  basis  for  the  space  of  solutions.  Using  your  set  of 
solutions  find  the  tangent  space  to  the  level  set  of  F  at  (1,  2,  3,  4). 

2.4.  (a)  Find  in  normal  and  parametric  form  the  normal  line  and  the  tangent  plane 

to  the  surface  z  =  ve-y  at  the  point  (1,0,  1). 

(b)  The  surfaces  x2  +  y2  —  z2  =  1  and  v  +  y  +  z  =  5  intersect  in  a  curve  F. 

Find  the  equation  in  parametric  form  of  the  tangent  line  to  r  at  the  point 

(1,2,2). 

2.5.  Find  the  equation  of  the  plane  passing  through  the  points  (1,2,3)  and  (4,5,6) 
which  is  perpendicular  to  the  plane  lx  +  8y  +  9z  =  10. 

2.6.  Find  the  equation  of  the  tangent  plane  to  *Jx  +  +  y/z  =  4  at  the  point 

(1,4,1). 

2.7.  Find  the  tangent  planes  at  (l/\/2,  1/4,  1/4)  and  (V3/2,  0,  1 /4)  to  the  ellipsoid 
x2  +  4y2  +  4 z2  =  1.  Find  the  line  of  intersection  of  these  two  planes.  Show 
that  this  line  is  tangent  to  the  sphere  x2  +  y2  +  z2  =  k  for  exactly  one  value 
of  k  and  find  this  value. 

2.8.  Find  the  coordinates  of  the  four  points  where  the  hyperbola  v  2  —  y2  =  1  and  the 
ellipse  x2  +  2y2  =  4  intersect.  Sketch  to  scale  both  curves  and  their  tangents 
at  the  points  of  intersection  of  these  tangent  lines.  If  (a,  b),  a  >  b  >  0,  is  one 
of  these  points  show  that  the  tangent  line  to  the  hyperbola  at  ( a ,  b)  coincides 
with  the  normal  line  to  the  ellipse  at  (a ,  b) .  Show  that  the  tangent  lines  to  the 
hyperbola  at  the  four  points  of  intersection  enclose  a  parallelogram  and  find 
its  area. 

2.9.  Find  the  direction  of  the  normal  line  at  the  point  (1,  1,4)  to  the  paraboloid 
z  =  v2  +  y2  +  2.  Find  the  tangent  plane  in  normal  form  at  this  point.  Show  that 
the  normal  line  meets  the  paraboloid  again  at  the  point  (—5/4,  —5/4,  41/8). 
If  6  is  the  angle  between  the  normal  line  through  the  point  (1,  1,4)  and  the 
normal  line  through  the  point  (—5/4,  —5/4,  41  /8)  show  that  sin  0  =  1  /V3. 

2.10.  Consider  the  subset  S  of  M3  which  lies  above  the  (x,  y)-plane  and  which  is 
characterised  by  the  property: 

p  e  S  the  distance  from  p  to  the  vy -plane  is 

the  logarithm  of  its  distance  to  the  z-axis. 

Describe  S  as  a  level  set  and  as  a  graph.  Find  the  normal  line  and  the  tangent 
plane  to  S  at  the  point  (1,  —1,  log  2/2). 

2.11.  Let  S 1  =  {(v,  y,  z)  6  M3  :  y  =  fix)}  denote  a  cylinder  and  let  S2  denote  the 
level  set  z2  +  2zx  +  y  =  0.  If  S\  is  tangent  to  S2  at  all  points  of  contact  find  /. 

2.12.  Let  a  =  (a  1,(22)  and  b  =  (b\,  ^2)  denote  two  non-zero  vectors  in  R2  making 
angles  0\  and  O2,  respectively,  with  the  positive  v-axis.  Show  that  cos^i  = 
01/ 1| a ||  and  sin^i  =  <^2/ l|a|| .  Show  that  62  —  0\  is  the  angle  between  a  and  b 
and,  by  expanding  cos($2  —  0\),  prove  that  cos($2  —  0\)  =  a  •  b/||a||  •  ||b||. 
Prove  the  same  result  for  arbitrary  vectors  in  M/7 . 


Chapter  3 

Lagrange  Multipliers 


Summary  We  develop  the  method  of  Lagrange  multipliers  to  find  the  maximum  and 
minimum  of  a  function  with  constraints. 

We  consider  the  problem  of  finding  the  maximum  and  minimum  of  a  sufficiently 
regular  function  g  of  n  variables  subject  to  the  constraints 


f 1  1 5  •  •  •  5  %n  )  —  £  1 


(3.1) 


To  apply  differential  calculus  we  suppose  there  exists  an  open  subset  U  of  W1  such 
that  each  fi  is  defined  and  differentiable  on  U  and  that  all  points  which  satisfy  (3.1), 
that  we  wish  to  consider,  lie  in  U.  We  also  require  g  to  be  smooth  in  some  fashion 
and  it  is  convenient  to  suppose  now  that  g  is  also  differentiable  on  U. 

If  F  =  (/i,  . . . ,  fm)  then  F:U  C  >  Mm  is  differentiable  and  our  problem 
can  be  restated  as  that  of  finding  the  maximum  and  minimum  of  g  on  the  level  set 
F~{  (C)  where  C  =  (c\ , . . . ,  cm).  To  apply  the  methods  of  the  previous  chapter  we 
suppose  that  F  has  full  rank  on  F~{  (C)  or  equivalently  that  {  V/i  ( X ),  . . . ,  V/m  (X)} 
are  linearly  independent  vectors  for  all  X  in  F_1(C). 

Suppose  g  has  a  local  maximum  on  F~l(C)  at  the  point  P.  Since  we  only  need 
to  examine  g  near  P  we  may  suppose  that  F~l(C)  is  the  graph  of  a  function  0 
of  n  —  m  variables.  By  rearranging  the  variables,  if  necessary,  we  can  assume  that 
0 :  V  C  Mm,  V  open  in  Mn-m,  that  P\  (the  first  n  —  m  coordinates  of  P) 

lies  in  V  and  P  =  (P\,  fi{P\)). 

Let  X\  =  (jci  , . . . ,  xn-m)  e  Wl~m.  Since  g  has  a  local  maximum  on  F~l(C)  at 
P  the  function 


X!  g(x u4>(Xi)) 


(3.2) 
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has  a  local  maximum  on  the  open  set  V  in  Rn~m  at  P\.  Moreover,  since  the  graph 
of  0  coincides  with  the  level  set  F~l  (C)  near  P  we  have  F(X\ ,  0(Xi))  =  C  for  all 
X\  e  V.  In  terms  of  the  coordinate  functions  this  implies 


fi(X\,  =  ct  (3.3) 

for  i  =  1,  . . . ,  m.  To  apply  the  chain  rule  to  the  functions  in  (3.2)  and  (3.3)  we  first 
differentiate  the  function 


Xi  e  V  C  Rn~m - >  (Xi,  4>(Xi))  e  K" 

and  obtain  the  n  x  (n  —  m)  matrix 

/  V(xi)  \  /I  ...  0\ 


m) 

— 

•  • 

•  • 

0  ...  1 

4>'(x  i) 

J 

4>'(x  i) 

/ 

Let  A  denote  the  matrix  in  (3.4)  evaluated  at  X\  =  P\ .  Since  the  first  n  —  m  rows  of 
A  form  a  matrix  with  determinant  1  the  matrix  A  has  maximum  rank  n  —  m.  Hence 
the  space  S  =  {X  e  M/7  :  X  o  A  =  0},  i.e.  the  solution  space  for  the  system  of  n—m 
linear  equations  in  n  variables,  is  an  n  —  (n  —  m)  =  m  dimensional  subspace  of  W1 . 
Applying  the  chain  rule  to  (3.3)  and  letting  X\  =  P\  we  obtain 


Xfi(P)oA  =  0 


for  i  =  1,  ...,m.  Hence  each  Vfi(P)  e  S.  Since  F'(P )  has  maximum  rank, 
{V/i  (P),  ,  V/m (P)}  is  a  set  of  m  linearly  independent  vectors  and  hence  a  basis 

for  S.  Since  the  function  in  (3.2)  has  a  local  maximum  on  the  open  set  V  at  P\ 
one  variable  calculus  shows  that  all  its  partial  derivatives  at  P\  are  zero.  A  further 
application  of  the  chain  rule  implies 

Xg(P)  o  A  =  0. 

Hence  Vg(P)  e  S  and  there  exist  m  scalars,  called  Lagrange  multipliers,  X\,  . . . ,  Xm, 
such  that 

Xg(P)  =  ki  V/i(P)  +  •  •  •  +  Xm  V/m(P).  (3.5) 

Equation  (3.5),  in  coordinates,  is  a  set  of  n  equations,  and  these  together  with  the 
system  of  m  equations  in  (3.1)  gives  n  +  m  equations  in  the  n  +  m  unknowns 
{x\,  . . . ,  xn,  . . . ,  Xm}.  The  method  of  Lagrange  multipliers  consists  in  solving 
these  equations.  The  solutions  are  the  critical  points  of  g  on  F~l  (C)  and  contain  the 
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Fig.  3.1 


local  maxima  and  minima  of  g.  To  determine  if  g  has  a  maximum  and  minimum  on 
F~l  (C)  requires  further  on-the-spot  investigation  as  we  shall  see  in  examples. 

Frequently  we  are  interested  in  finding  the  maximum  and  minimum  on  U  Pi 
F~l(C)  where  U  is  open  in  W1  and  U  is  the  closure  of  U  in  W1.  The  set  U  consists 
of  all  points  which  can  be  reached  from  U,  i.e. 


U  =  {X  e  there  exists  (Xn)n  e  U  with  \\Xn  —  X||  — >►  0  as  n  — >  oo}. 

In  this  case  we  apply  the  method  of  Lagrange  multipliers  to  U  Pi  F~{  (C)  and  exam¬ 
ine  separately  the  values  of  g  on  (U\U)  Pi  F_1(C) — these  points  are  often  easily 
identified  since  they  lie  on  the  boundary  of  U. 

Example  3.1  To  find  the  maximum  and  minimum  of  v  +  y  +  z  subject  to  the  con¬ 
straints  x2  +  y2  =  2  and  v  +  z  =  1.  Let  F  =  (/i,  fi)  where  f\(x,  y,  z)  =  x2  +  y2 
and  /2(x,  y,  z)  =  v+z  andletg(v,  y,  z)  =  v  +  y +  z.  We  wish  to  find  the  maximum 
and  minimum  of  g  on  the  set  F~l(2,  1).  We  have 


F'(x,  y,  z) 


2x  2  y  0  \ 

1  0  l)  • 


On  F_1( 2,  1)  we  have  x2  -\-  y2  =2  and  so  v  and  y  cannot  both  be  zero  at  the 
same  time.  Thus  the  rows  of  F'(x,  y,  z)  are  linearly  independent  when  (x,  y,  z)  E 
F_1( 2,  1).  Hence  V/i  and  V/2  are  linearly  independent  on  F~{( 2,  1)  and  we  may 
apply  the  method  of  Lagrange  multipliers.  The  level  set 

F~l(2,  1)  =  { (x :  y,  z)  :  x2  +y2  =  2}  D  {(x,  y,z)  :x  +  z  =  1} 


is  the  intersection  of  a  cylinder  parallel  to  the  z-axis  and  a  plane  (why?)  (see  Fig.  3.1). 
This  gives  us  a  closed  ellipse  and  compact  set  in  R3  .  By  the  fundamental  existence 
theorem  g  has  a  maximum  and  a  minimum  on  F~l( 2,  1).  This  apparently  rather 
theoretical  information  has  practical  implications  since  we  now  know,  before  doing 
any  calculations,  that  the  method  of  Lagrange  multipliers  will  yield  at  least  two 
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solutions  and  the  maximum  (minimum)  solution  will  be  the  maximum  (minimum) 
of  g  on  F~ 1  (2,  1 ) .  By  the  method  of  Lagrange  multipliers  there  exist,  at  local  maxima 
and  minima,  scalars  X  \  and  X2  such  that 

Vg(x,  y,z )  =  A-i Vfi(x,  y,z )  +  A2V/2(x,  y,  z) 


i.e. 

(l,l,l)=X1(2x,2y,0)+X2(l,0,  1). 


In  terms  of  coordinates  this  is  equivalent  to  the  following  system  of  equations 


1  —  2X\x  H-  X2 
1  =  2Xi  y 
1  =  A2 


1  —  2X\x  H-  1  y  X\x  —  0. 


Hence  X\  =  0  or  x  =  0.  However,  the  second  equation  implies  1  =  2X\y  so 
X\  7^  0.  If  v  =  0  then  f\(x,  y,  z)  =  y2  =  2  and  y  =  =b\/2,  while  f2(x,  y,  z)  = 
x  +  z  =  1  implies  z  —  1.  Our  only  solutions  are  (0,  \/2,  1)  and  (0,  —  V2,  1).  Since 
g(0,  V2,  1)  =  V2  +  1  and  g(0,  —\[2,  1)  =  —\f2  +  1  it  follows  that  \[2  +  1  and 
— V2  +  1  are  the  maximum  and  minimum  of  g  on  F  1  (2,  1). 

Of  course,  substituting  the  constraint  v  +  z  =  1  into  g  =  0  implies  g(x,  y,  z)  = 
g(x,  y,  1  —  x)  =  1  +  y  and  our  problem  reduces  to  finding  the  maximum  and 
minimum  of  1  +  y  on  the  set  x2  +  y2  =  2.  On  this  set  —\pl  <  y  <  \pl  and  hence  the 
maximum  is  1  +  y/2  and  the  minimum  1  —  \f2.  This  verifies  the  solution  obtained 
using  Lagrange  multipliers  and  also  reminds  us  that  it  is  always  worthwhile  thinking 
about  a  problem  before  attempting  any  solution. 


Example  3.2  The  geometric  mean  of  n  positive  numbers  x\,  , 

X\  T  x2  T  ■  ■  ■ 

is  always  less  than  or  equal  to  the  arithmetic  mean - 


Xn,  (X\  ■  ■  -Xn)l/n 

+  Xn 


n 

In  this  example  (see  also  Example  3.4)  we  standardise  one  of  the  quantities  and 
generate  a  Lagrange  multiplier  type  problem.  Let  g(x\,  . . . ,  xn)  =  x\  •  •  •  xn  and 

f{x  1, ...  ,xn)  =  x\  +  x2  H - 1-  xn .  We  begin  by  finding  the  maximum  of  g  on  the 

set 


/  x(l)  fl  {(xi,  . . . ,  xn)  :  Xi  >  0  for  all/}. 


It  is  easily  checked  that  this  is  a  compact  set  and,  by  the  fundamental  existence 
theorem,  the  maximum  exists.  On  the  set  /-1  (1)  H  {(vi ,  . . . ,  xn)  :  Xi  >  0}  we  may 
apply  the  method  of  Lagrange  multipliers.  We  have 


C^l  5  •  •  •  ?  Xn )  —  (X2  ’  ’  ’  Xn ,  X 1 V3  •  •  •  Xn ,  .  .  .  ,  X 1 X2  '  '  '  Xn — 1 ) 


1  1 


=  X\ 


X 


n 


X\  x2 


and 

V/(v i,  . . .  ,xn)  =  (1,  1,  . . . ,  1). 
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X 

Fig.  3.2 


If  Vg(xi ,  . . . ,  xn)  =  AV/(v i,  . . . ,  xn)  then  x\  •  •  •  xn/xi  =  A  and  Xi  =  x\  •  •  •  xn/X 
for  all  /.  This  shows  m  =  V2  =  •  •  •  =  xn.  Since  x\  +  X2  +  •  •  •  +  xn  =  1  we 
have  Xi  =  \/n  for  all  i  and  g(l/n,  1  /n,  . . . ,  l/n)  =  n~n .  As  g(vi,  . . . ,  xn)  =  0 
whenever  one  of  the  x\  s  is  equal  to  zero  it  follows  that  the  maximum  of  g,  on  the 
set  f{x i, . . . ,  xn)  =  1  and  Xi  >  0  all  i,  is  (1  /n)n. 

If  xi ,  i  =  l, ...  ,n,  are  arbitrary  positive  numbers  let  yt  =  Xi  j  Xj=i  xj  f°r  eac^ 
i .  We  have 

n  n  n 

X  =  Zx  /  =  r 

/  — 1  /  =  1  7  =  1 


and,  by  the  first  part, 


Hence 


and,  as  required, 


(x\  •  •  •  Xn)^^n  <  —  (x\  +  X2  +  •  •  •  +  xn). 

n 


Notice  that  although  the  number  of  variables,  n,  could  be  large  symmetry  led  to  rather 
simple  equations.  In  mathematics  symmetry  compensates  for  size. 


Example  3.3  To  find  the  dimensions  of  the  box  (Fig.  3.2)  of  maximum  volume  V 
given  that  the  surface  area  S  is  10m2. 

We  wish  to  maximise  V(x,  y,  z)  =  xyz  subject  to  the  constraint  S(x,  y,  z)  = 
2 (xy  +  yz  +  xz)  =  10.  Physical  constraints  imply  that  v  >  0,  y  >  0  and  z  >  0.  The 
set  U  =  {(v,  y,  z)  :  x  >  0,  y  >  0,  z  >  0}  is  open  and  our  problem  is  to  determine 
the  maximum  of  V  on  U  HS.  The  method  of  Lagrange  multipliers  will  identify  a  set 
which  contains  all  local  maxima  and  minima  in  U  H  S.  We  proceed  to  do  this  now. 
We  have 
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W (v,y,z)  =  (yz,  xz,  xy) 

and 

VS(x,  y,  z)  =  (2 (y  +  z),  2(x  +  z),  2(x  +  y)). 

Since  x,  y  and  z  are  all  positive  on  U  we  have  W(x,  y,  z)  7^  (0,  0,  0).  Applying 
the  method  of  Lagrange  multipliers  we  consider 

W  =  XWS 


and  obtain  the  following  system  of  equations 

yz  =  2 X(y  +  z)  =>  xyz  =  2X(xy  +  xz) 
xz  =  2X(x  +  z)  =>•  xyz  =  2X(xy  +  yz) 
xy  =  2X(x  +  y)  ==>►  xyz  =  2X(xz  +  yz). 

Hence  A  ^  0  and,  on  dividing  by  2X,  we  get  xy  +  xz  =  xy  +  yz  =  xz  +  yz  and 
afterwards  xy  =  yz  =  zx.  Since  1  ^  0,  y  /  0  and  z  ^  0  this  implies  x  =  y  =  z 
and  S(x,y,  z)  =  10  now  shows  that  6x2  =  10.  Hence  x  =  y  =  z  =  (5 /3) 1//2 .  Let 

p  =  t(5/3)'/2,  (5/3)1/2,  (5/3)1/2).  We  have  V(P)  =  (5/3)3/2. 

We  now  wish  to  show  that  V  takes  its  maximum  value,  subject  to  the  constraint 
S  =  10,  at  P.  The  set  U  H  S-1  (10)  is  closed  but  not  bounded  and  so  the  fundamental 
existence  theorem  for  maxima  and  minima  cannot  be  applied  directly.  In  such  cases 
ad  hoc  methods  are  necessary  and  the  following  approach  is  sometimes  useful.  For 
r  >  0  let 

Ur  —  {(v,  y,  z)  :  0  <  v  <  r,  0  <  y  <  r,  0  <  z  <  r}. 

The  set  Ur  is  compact  (i.e.  closed  and  bounded)  and  as  r  increases  to  infinity  Ur 
expands,  inside  U,  and  in  the  limit,  i.e.  at  infinity,  it  covers  U.  If  V  does  have 
a  maximum  on  U  H  S-1(10)  then  it  must  lie  in  some  Ur.  Since  V(P)  >  1  our 
strategy  is  to  show  that  for  r  sufficiently  large  we  have  V(x,y,z)  <  1  whenever 
(x,  y,  z)  ^  Ur.  This  will  imply  that  the  maximum  of  V  on  Ur,  which  exists  by  the 
fundamental  existence  theorem,  equals  the  maximum  of  V  on  U.  If  (x,  y,  z)  ^  Ur 
then  one  of  x,  y,  z,  say  x,  is  greater  than  r.  If  S(x,  y,  z)  =  10  then  2xy  <  10  and 
2xz  <  10.  Hence  y  <  10/2v  =  5/x  and  z  <  10 /2x  =  5/x  <  5/r  and 

5  5  25 

V(x,  y,  z)  =  xyz  <  x  •  —  •  -  =  — .  (3.6) 

x  r  r 

In  particular,  if  r  =  25  then  V(v,y,z)  <  1  for  all  (x,  y,  z)  ^  U25  satisfying 
S(x ,  y,  z)  =  10.  Since  P  e  U25  and  V(P)  >  1  this  implies 
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maximumj  V(x,  y,  z)  :  (x,  y,  z)  e  U  n  S  1  (10)} 

=  maximumj  V  (x,  y,  z)  :  (x,  y,  z )  G  U25  H  S  1(10)}_ 


Since  U25  Pi  -S'- 1  (10)  is  compact  and  V  is  continuous  the  fundamental  existence  the¬ 
orem  implies  that  V  has  a  maximum  on  U25  El  S_1  (10).  The  maximum  occurs  either 
inside  or  on  the  boundary.  If  it  occurs  inside  then  our  use  of  Lagrange  multipliers 
implies  that  it  occurs  at  the  point  P.  On  the  boundary  of  U25  we  have  either  at  least 
one  coordinate  zero,  in  which  case  V  =  0,  or  at  least  one  coordinate  equal  to  25  in 
which  case  (3.6)  implies  V  <  1  and  we  conclude  that  the  maximum  cannot  occur 
on  the  boundary.  We  have  thus  shown  that  the  absolute  maximum  of  V  occurs  at  the 
point  P  =  ((5/3)1//2,  (5/3)1//2,  (5/3)1/2)  and  equals  (5/3)3//2. 

Example  3.4  In  this  example  we  prove  the  Cauchy-Schwarz  inequality 


n 


i= 1 


where  at  and  bi  are  arbitrary  real  numbers.  This  is  one  of  the  best  known  and 
most  widely  used  inequalities  in  mathematics.  If  we  use  vector  notation,  i.e.  let 
a  =  (a\,  ...  ,an)  and  b  =  (b\,  . . . ,  bn),  and  take  square  roots  we  can  rewrite  the 
inequality  as 


(a,  b) 


(3.7) 


The  advantages  of  vector  notation  in  this  proof  are  obvious.  Let  g :  M2/7  — >  R  be 
defined  by  g(  a,  b)  =  (a,  b) .  Motivated  by  Example  3.2  and  the  form  of  the  inequality 
given  in  (3.7)  we  first  find  the  maximum  and  minimum  of  g  on  the  set  || a || 2  = 
||b||2  =  1. 

Let  /i:R2w  — >  R  and  /2:M2,?  R  be  defined  by 


n 

/i(a,  b)  =  ||a||2  =  (a,  a)  =  ^ of 

i  =  1 


and 

n 

fi( a,  b)  =  ||b||2  =  (b,b>  =  ^&2. 

i=\ 


We  wish  to  maximise  g (a,  b)  =  ai^i  —  (a>  on  the  set  S  =  {(a,  b)  :  ||a||2  = 
|| b || 2  =  l}.  The  set  S  is  easily  seen  to  be  compact  and  hence  g  has  a  maximum  and 
minimum  on  S.  We  have 


V/i(a,  b)  =  2(a\, . . . ,  an,  0, . . . ,  0)  =  2(a,  0) 
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and 

V/2( a,  b)  =  2(0, . . . ,  0,  b\,  . . . ,  bn)  =  2(0,  b) 
where  0  denotes  the  origin  in  M77 .  An  inspection  of  the  matrix 

/VA(a,b)\  =  /2a  0  \ 

\V/2(a,  b)/  V°  2bJ 

shows  immediately  that  V/i(a,  b)  and  V/2( a,  b)  are  linearly  independent  whenever 
a/0  and  b  ^  0  and  implies  that  we  may  apply  the  method  of  Lagrange  multipliers. 
From  Vg  =  AiV/i  +  A2V/2  and 

Vg(  a,  b)  =  (b\,  ...,bn,ai,  . . . ,  an)  =  (b,  a) 


we  see  that 

(b,  a)  =  Ai(2a,  0)  +  A2(0,  2b)  =  (2Aia,  2A2b). 

Hence  b  =  2Aia  and  since  ||a||  =  ||b||  =  1  this  implies  2X\  =  ±1  and  b  =  ±a.  If 
b  =  a  then  g( a,  a)  =  af  —  1>  while  if  b  =  —a  then  g( a,  —a)  =  — 

=  —  1.  Hence,  if  ||a||  =  ||b||  =  1,  then 

-1  <£(a,  b)  <  1 


and 


^(a,  b) 


<  1. 


If  a  7^  0  and  b  ^  0  then 


and 


g(a,b) 

,  „2„ki2  <  1 

a2  br 


^(a,  b) 


This  is  trivially  verified  if  either  a  or  b  is  0.  Since  g(a,  b)  =  (a,  b),  we  have  proved 
the  Cauchy-Schwarz  inequality.  The  above  also  shows  that  we  have  equality  in  the 
Cauchy-Schwarz  inequality  if  and  only  if  a  and  b  are  parallel  vectors.  In  particular, 
for  unit  vectors  a  and  b,  we  have  (a,  b)  =  1  if  and  only  if  a  =  b  (see  Example  8.5). 


Exercises 

3.1.  Find  the  maximum  and  minimum  of  vy  +  yz  on  the  set  of  points  which  satisfy 
x2  +  y2  =  1  and  yz  =  v. 


3  Lagrange  Multipliers 


33 


3.2.  Find  the  highest  and  lowest  points  on  the  ellipse  of  intersection  of  the  cylinder 
x2  +  y2  =  1  and  the  plane  v  +  y  +  z  =  1. 

3.3.  Suppose  a  pentagon  is  composed  of  a  rectangle  surmounted  by  an  isosceles 
triangle.  If  the  length  of  the  outer  perimeter  P  is  fixed  find  the  maximum 
possible  area. 


(Hint:  you  must  maximise  ab-\-\bd  subject  to  the  constraints  b  +  2a  +  2c  =  P 
and  (b/ 2) 2  +  d2  =  c2 .) 

3 .4.  Find  the  minimum  value  of  x2  +  y2  +  z2  subject  to  the  constraints  x  +  y  —  z  =  0 
and  x  +  3y  +  z  =  2.  Give  a  geometrical  interpretation  of  your  answer  and 
using  your  interpretation  explain  why  x2  +  y2  +  z2  has  no  maximum  subject 
to  these  constraints. 

3.5.  Explain  why  the  function  f(x,y,z)  =  x2  +  y2  +  2 z2  has  a  minimum  on  the 
surface  S  defined  by  the  equations  x  +  y  +  z  =  3  and  v  —  y  +  3z  =  2.Use 
Lagrange  multipliers  with  two  constraints  to  find  the  minimum  value.  Using  the 
first  constraint  find  0  such  that  z  =  0  (x ,  y )  on  S.  Now  use  Lagrange  multipliers 
to  find  the  minimum  of  /  (x ,  y ,  0  (x ,  y ))  on  the  level  set  x  —  y  +  30  (x ,  y )  =  2. 
Using  both  constraints  express  y  and  z  as  functions  0i  and  02  of  x  for  all  points 
on  S.  Find  the  minimum  of  f(x,  0i(v),  0200)  using  one- variable  calculus. 

3.6.  Show  that  the  maximum  and  minimum  of  f(x,  y,  z)  =  x /a  +  y/b  +  z/c  on 
the  ellipsoid  ( x/a )2  +  (y/b)2  +  (z/c)2  =  1  are  \/3  and  —  \/3  respectively 
where  a ,  Z?  and  c  are  positive  constants. 

3.7.  Find  the  minimum  value  of  vyz  on  the  level  set  F-1(l),  v  >  0,  y  >  0,  z  >  0, 
where 

1  1  1 

F(x,  y,z)=  \ - F  -  • 

x  y  z 

3.8.  Use  Lagrange  multipliers  to  find  the  maximum  volume  of  the  rectangular  solid 
in  the  first  octant  (v>0,y>0,z>0)  with  one  vertex  at  the  origin  and  the 
opposite  vertex  lying  in  the  plan o  x/a  +  y /b  +  z/c  =  1  where  a ,  b  and  c  are 
positive  constants. 

3.9.  Show  that  of  all  triangles  inscribed  in  a  fixed  circle  the  equilateral  triangle 
maximises;  (a)  the  product  of  the  lengths  of  the  sides,  (b)  the  sum  of  the 
squares  of  the  lengths  of  the  sides. 

3.10.  Find  the  minimum  of  f(x,  y)  =  2y  on  the  set  3x2  —  y5  =  0. 
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3.11.  If  /:  t/(open)  c  W1  — >  R,  P  e  £/  and  V/(P)  7^  0  find  the  direction,  at  P, 
in  which  /  increases  most  rapidly. 


3.12.  Show  that  the  maximum  of  x\  •  •  •  xn 


n\ 

nn/2' 


3.13.  Within  a  triangle  there  is  a  point  P  such  that  the  sum  of  the  squares  of  the 
distances  to  the  sides  is  a  minimum.  Find  this  minimum  in  terms  of  the  lengths 
of  the  sides  and  the  area. 

3.14.  Find  the  distance  from  the  point  (10,  1,  —6)  to  the  intersection  of  the  planes 


x  +  y  +  2z  =  5 
2x  —  3y  +  z  =  12. 


3.15.  What  is  the  relationship  between  Exercise  2.12  and  the  Cauchy-Schwarz 
inequality  (Example  3.4)? 

3.16.  If  a  and  b  are  positive  numbers  find  the  maximum  and  minimum  values  of 
(xv  —  yu)2  subject  to  the  constraints  x2  +  y2  =  a 2  and  u2  +  v2  =  b2 . 


Chapter  4 

Maxima  and  Minima  on  Open  Sets 


Summary  We  derive,  using  critical  points  and  the  Hessian,  a  method  of  locating 
local  maxima,  local  minima  and  saddle  points  of  a  real-valued  function  defined  on 
an  open  subset  of  Wl . 


We  turn  to  the  problem  of  finding  local  maxima  and  local  minima  of  a  real-valued 
function  /  defined  on  an  open  subset  U  of  W: .  The  set  of  critical  points  of  /  on 
U,  {X;  V/(X)  =  0},  will  include  all  points  where  /  achieves  a  local  maximum  or 
minimum  but  may  contain  additional  points  such  as  saddle  points.  A  critical  point 
P  is  a  saddle  point  if  /,  restricted  to  some  curve  passing  through  P,  has  a  local 
maximum  at  P,  while  it  has  a  local  minimum  at  P  along  some  other  curve  passing 
through  P. 

The  Hessian  of  /  at  P  e  U ,  Hf(p),  is  defined  as  the  n  x  n  matrix 


( 


d2/ 

3  Xi  dx 


( P ) 


) 


.  To  define  the  Hessian1  we  are,  of  course,  assuming  that  all 


1  J  /  1  <i,j<n 

first-  and  second-order  partial  derivatives  of  /  exist.  We  use  the  convention  that  the 
order  of  differentiation  is  from  right  to  left,  i.e. 


92/ 

dxi  dxj 


d 

dxi 


If  all  these  second-order  partial  derivatives  exist  and  are  continuous  then  the  order 
of  differentiation  is  immaterial  and 


82/ 

dxi  dxj 


92/ 

dx  j  dxi 


for  all  i  and  j.  In  this  case  Hf(p)  is  a  symmetric  n  x  n  matrix.  We  will  not  prove  this 
result  but  provide,  in  Exercise  4.6,  a  practical  method  which  proves  that  all  functions 
you  will  probably  ever  encounter  have  this  property.  If  v  =  (vi,  . . . ,  vn)  is  a  row 


1  The  notation  V2(/)  is  also  used  for  the  Hessian  of  /. 
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vector  and  V  is  the  corresponding  column  vector  then 


9/ 

9v 


c?)  =  Z 

i  =  l 


Vi 


and 


9V 

9v2 


=  ▼»/(/•)  v. 


The  main  theoretical  result  on  the  existence  of  local  maxima,  local  minima  and  saddle 
points  is  the  following  theorem. 

Theorem  4.1  If  U  is  an  open  subset  of  M",  /:  U  — >  M  w  a  Ov/ce  continuously 
differentiable  function  on  U  and  P  is  a  critical  point  of  f,  i.e. 


V/(P)  = 


( 


df  3/ 

J  =0 


3v 


■) 


then 


(1)  /  /zas  a  strict  local  maximum  at  P  if 

(2)  /  has  a  strict  local  minimum  at  P  if 


.  92/ 


3v2 

.,9V 

3v2 


(P)  <  0  for  all  v  ^  0, 
>  0  for  all  v  ^  0, 


(3)  /  /zas  a  saddle  point  at  P  if  there  exist  v  and  w  swc/z  that 

d2f  d2f 

^(p)  <°<  ^(p)- 

rZ  3  wz 


dv J 


To  derive  a  practical  test  from  this  result  we  use  linear  algebra  and  Lagrange  mul¬ 
tipliers.  To  simplify  matters  we  change  our  notation  and  let  v  =  X  =  (x\,  . . . ,  xn), 

a 2/ 

(P)  and  A  =  ( Pij)l<i,j<n •  With  this  notation 


aij  — 


dxi  dxj 


9V 

9v2 


vHf(P)‘\  =  XA'X. 
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Since 


for  X  7^  0  we  have 

XA'X  >  0  for  all  X  ±  0  min  X  A‘X  >  0 

II  jr  H=i 

XA'X  <  0  for  all  X  ^  0  max  X A'X  <  0 

11X11  =  1 


there  exists  X,  Y  e  M/7  such  that 

XArX  <  0  <  Y A*  Y  min  XAfX  <  0  <  max  XAfX. 

||X||  =  1  ||  JX  ||  =  1 

We  thus  need  to  examine  the  extreme  values  of  XAfX  on  the  set 

n 

||X||2  =  (X,  X)  =  ^>2  =  1. 

i  =  1 


Let 

n 

h(X)  =  h(x\, ... ,  xn)  =  ciijXiXj  =  XA*X 

i,j= i 

and 

n 

g(X)=g(x  i,...,xn)=  S3  X?  =  (X,  X). 

ij  =  1 

Since  the  set  g~l  (1)  is  compact  and  h  is  continuous  the  fundamental  existence  the¬ 
orem  implies  that  h  has  a  maximum  and  minimum  on  g~l  (1).  Using  the  coordinate 
expansion  of  g  we  see  that  Vg(X)  =  (2xi, . . . ,  2xn)  =  2X  and  Vg(X)  ^  0  on 
g_1(l).  Hence  we  may  apply  the  method  of  Lagrange  multipliers  to  find  the  maxi¬ 
mum  and  the  minimum  of  h  on  the  set  g_1  (1).  We  have 


Vh(X)  = 


=  2XA. 


By  the  method  of  Lagrange  multipliers  there  exists  at  the  maximum  and  minimum 
points  of  h  on  g_1  (1)  a  real  number  A  such  that 


2XA  =  2\X. 
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Taking  the  transpose  we  get 

ArX  =  X*X  =  XIfX 


i.e. 

(A-A/)rX  =  0.  (4.1) 

Since  ||X||  =  1,  any  A  which  satisfies  (4.1)  is  an  eigenvalue2  of  A  and,  moreover, 


h(X)  =  XA'X  =  XX*X  =  X(X,  X)  =  X . 


Thus  the  maximum  and  minimum  values  of  h  are  eigenvalues  of  A  and  are  achieved 
at  the  corresponding  unit  eigenvectors.  If  all  eigenvalues  are  positive  then  h  is  always 
positive  and  /  has  a  local  minimum  at  P,  if  all  eigenvalues  are  negative  then  h  is 
always  negative  and  /  has  local  maximum  at  P  and  if  some  are  positive  and  some 
negative  then  h  takes  positive  and  negative  values  and  /  has  a  saddle  point  at  P .  If 
A  is  an  eigenvalue  of  A  the  set 


Ex  \=  {X  e  :  ArX  =  XrX) 


is  a  subspace  of  W\  called  the  X-eigenspace  of  A,  and  the  dimension  of  Ex  is  called 
the  multiplicity  of  the  eigenvalue  A.  An  n  x  n  symmetric  matrix  has  n  eigenvalues 
when  eigenvalues  are  counted  according  to  multiplicity,  i.e.  if  Ex  is  j -dimensional 
then  A  is  counted  j  times,  and  det(A)  =  X\  •  •  •  Xn .  Since  a  particular  case  of  the  above 
result  will  play  an  important  role  in  our  study  of  Gaussian  curvature  in  Chap.  16  we 
display  it  separately. 

Proposition  4.2  If  A  =  (0*7)  1<?-  -<2  is  a  symmetric  2x2  matrix  with  eigenvalues 
X\  and  X2,  X\  >  A2,  then 


max.janv  +  la\2xy  +  <222^  -x  +  y  =  1}  =  Ai 

and 

min.{anv  +  2auxy  +  a22y  -x  +y  =  1}  =  A2. 

Moreover,  the  maximum  and  minimum  are  achieved  at  eigenvectors  of  A  and 

det(A)  =  A1A2. 

For  our  test  we  just  need  to  know  the  sign  of  the  largest  and  smallest  eigenvalues. 
Since  eigenvalues  may  be  difficult  to  calculate  we  will  use  a  reasonably  well-known 
result  from  linear  algebra.  If  A  is  a  square  matrix  then  the  k  x  k  matrix  Ax  obtained 
by  deleting  all  except  the  first  k  rows  and  k  columns  of  A  is  called  the  k  x  k  principal 
minor  of  A.  We  have  An  =  A. 


2  We  could  refocus  this  analysis  to  show  that  any  symmetric  n  x  n  matrix  with  real  entries  admits 
eigenvalues. 


4  Maxima  and  Minima  on  Open  Sets 


39 


Ai 


\ 


J 


We  require  the  following  result: 

if  A  is  a  symmetric  n  x  n  matrix  then  all  eigenvalues  of  A  are  positive  if  and  only 
ifdet(Ak)  >  0  for  all  k. 

Proposition  4.3  Iff:  U  C  Wl  —>  R,  U  open,  has  continuous  first-  and  second- 
order  partial  derivatives,  P  is  a  critical  point  of  f,  and  A  =  Hf^p )  is  the  Hessian 
of  f  at  P  then  the  following  hold: 

(a)  ifdet(A2k)  <  0  for  some  k  then  P  is  a  saddle  point  of  f, 

(b)  ifdet(An)  0  then 

(bl)  if  det(A^)  >  0  for  all  k  then  f  has  a  strict  local  minimum  at  P, 

(b2)  if  (— 1)*  det(AO  >  0  for  all  k  then  f  has  a  strict  local  maximum  at  P, 

(b3)  if  the  conditions  on  the  determinants  in  (bl)  and  (b2)  do  not  apply  then  f 
has  a  saddle  point  at  P. 

Proof  (a)  Suppose  det(A2£)  <  0  for  some  positive  integer  k,  2k  <  n.  If  P  = 
(p i,  . . . ,  pn)  let  Q  =  (p i,  . . . ,  p2k)-  Consider  the  function  g :  V  C  R2k  ->  R 
defined  by 

g  (v  1 ,  .  .  .  ,  X2 k )  —  f  (A  1  ?  •  •  •  j  k  •>  P2k+  !■>  •  •  •  ?  Pn  ) 
where  V  is  the  open  set  in  R2k  consisting  of  all  (x\ ,  . . . ,  X2k)  such  that 

(jTl,  .  .  .  ,  X2 k->  P2k+h  •  •  •  ?  Pn)  ^  U. 


It  is  easily  checked  that 


Let  fii,  . . . ,  P2 k  denote  the  2k  eigenvalues  of  the  symmetric  2k  x  2k  matrix  A 2k 
counted  according  to  multiplicity.  We  have 


P\  P2k  =  det (A2jfe)  <  0. 
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Since  2k  is  an  even  integer  it  follows  that  A 2k  has  positive  and  negative  eigenvalues. 
Hence  there  exist  u  =  («i,  . . . ,  U2k )  and  v  =  (vi,  . . . ,  V2k)  such  that 

uA2k\y  <  0  <  vA2*V. 


Let  wi  =  (mi,  . . . ,  U2k,  0, . . . ,  0)  and  W2  =  (vi, . . . ,  V2k,  0, . . . ,  0).  Then 


d2/ 

3wi2 


(P)  =  w |  Hfip/Wi 


uA2k‘u  <  0  <  \A2kty 

d2f 


=  W2  Hf(p)  W2  = 


9w2: 


(/>) 


and  /  has  a  saddle  point  at  P.  This  proves  (a). 

(b)  The  first  part,  (bl),  follows  directly  from  the  linear  algebra  result  quoted  above. 
Since  /  has  a  local  maximum  at  P  if  and  only  if  (— /)  has  a  local  minimum  at  the 
same  point  and  H-f(p)  =  — Hf(p )  it  follows,  from  (bl),  that  /  has  a  local  maximum 
at  P  if  det(— Ak)  >  0  for  all  k.  Hence  (b2)  follows  since  det(— A^)  =  (— 1)^  det(A^). 
If  (bl)  and  (b2)  do  not  apply  then,  since  all  eigenvalues  of  A  are  non-zero,  A  has 
positive  and  negative  eigenvalues  and  hence  /  has  a  saddle  point  at  P.  This  completes 
the  proof.  □ 

If  det (Hf(p))  =  0  we  call  P  a  degenerate  critical  point  of  /  (all  other  critical 
points  are  called  non-degenerate)  and  higher  order  derivatives  may  be  required  to 
test  the  nature  of  the  critical  point. 

Proposition  4.3  enables  us  to  classify  all  non-degenerate  and  some  degenerate 
critical  points.  When  P  is  a  non- degenerate  critical  point  of  /  and  /  has  a  local 
maximum  or  minimum  at  P  then  the  determinants  of  the  odd  principal  minors  all 
have  the  same  sign.  This  leads  to  the  second  test  for  saddle  points  given  below  and 
a  little  reflection  shows  that  it  may  be  applied  at  any  critical  point.  In  practice  the 
determinants,  det  (A/),  are  calculated  in  the  order  i  =  1,  2,  3, . . .  and  testing  for 
saddle  points  is  carried  out  as  the  calculations  proceed.  The  critical  point  is  a  saddle 
point  and  the  calculations  stop  when  for  the  first  time  either  of  the  following  is 
observed: 

det(A2;0  <  0 

det(A2fc-i)  det(A2&+i)  <  0. 

If  (bl)  or  (b2)  are  used  to  find  a  local  maximum  or  minimum  then  all  determinants 
of  the  Hessian  must  be  calculated. 
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Example  4.4  Let  f(x,y,z)  =  x2y2  +  z2  +  2x  —  4y  +  z.  We  have 

V f(x,  y,  z)  =  ( 2xy 2  +  2,  2x2y  -4,2z  +  1). 

If  P  is  a  critical  point  of  /  then 


2  xy2  +  2  =  0 
2 x2y  -4  =  0 
2z  +  1  =  0 

Hence  z  =  —1/2  from  the  third  equation.  From  the  first  two  equations  we  see  that  v 
and  y  are  non-zero.  Hence  xy2  =  —  1  and  x2y  =  2  imply  xy2/x2y  =  —1/2  =  y/x 
andv  =  — 2y.  We  have  — 2y-y2  —  —  l,i.e.  y3  =  1/2  and  y  —  2-1/3.Fromv  =  —2 y 
we  obtain  v  =  — 22^3  and  conclude  that  (— 22//3,  2-1/3,  —1/2)  is  the  only  critical 
point  of  /.  A  simple  calculation  shows  that 

(2y2  4 xy  0\ 

Hf(x,y,z )  —  I  4vy  2x2  0  I 

\  0  0  2/ 

and  hence 

/  21/3  —4  •  21/3  0\ 

%-2^/3, 2-1/3, 1/2)  =  (  -4  •  21/3  2  •  24/3  0  . 

V  0  02/ 

Since  det(21/3)  >  0  and 

-U'S/3  — 

=  4  •  22/3  -  16  •  22/3  <  0 

the  critical  point  (— 22/3,  2-1/3,  —1/2)  is  a  saddle  point  of  /. 

Example  4.5  We  wish  to  find  and  classify  the  non-degenerate  critical  points  of 
fix,  y,  z)  =  x2y  +  y2z  +  z2  —  2x.  We  have 

Vf(x,  y,  z )  =  (2xy  -  2,  x2  +  2yz,  y2  +  2 z) 

and  the  critical  points  satisfy  the  equations 

2xy  —  2  =  0,  x2  +  2yz  =  0  and  y2  +  2z  =  0. 

Substituting  z  =  —  y2/2  into  the  second  equation  implies  y3  =  x2.  Hence,  the  first 
equation  shows  y5/2  =  1  and  we  have  y  =  1  and  z  =  —1/2.  From  xy  =  -1  we  get 
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x  =  1  and  (1,1,  —1/2)  is  the  only  critical  point  of  /.  We  have 

Hf(x,y,z)  =  / 

and 

Hf  (1,1,  — 1/2)  = 

Since  det(2)  >  0  and 

"(2  A) 

the  point  (1 ,  1 ,  —  1  /2)  is  a  saddle  point  of  /. 

A  number  of  simple  initial  checks  may  be  carried  out  to  detect  saddle  points.  These 
are  based  on  rearranging  the  variables,  a  procedure  which  does  not  alter  the  nature 
of  a  critical  point.  Thus  we  may  first  interchange  row  i  and  row  j  and  afterwards 
column  i  and  column  j  in  the  Hessian  and  then  apply  our  standard  test.  In  this  way 
any  2x2  sub-matrix  of  the  Hessian,  which  is  symmetric  about  the  diagonal,  can 
be  moved  to  become  the  2x2  principal  minor.  Since  a  2  x  2  symmetric  matrix 
with  positive  and  negative  diagonal  entries  has  negative  determinant  this  means  that 
a  critical  point  is  a  saddle  point  if  the  diagonal  contains  positive  and  negative  terms. 
This  applies,  for  instance,  to  Example  4.5.  Another  useful  indicator  is  the  presence 
of  a  relatively  large  non-diagonal  term;  if  ciij ,  the  i  j th  term  in  the  Hessian,  satisfies 
| dij |  >  \akk\  for  all  k  then,  as  we  see  in  our  next  example,  this  also  implies  that  the 
critical  point  is  a  saddle  point. 

It  is  not  necessary  to  physically  interchange  these  rows  and  columns  as  the  appro¬ 
priate  sub-matrix  can  be  isolated  by  inspection  but  to  convince  you  that  this  process 
is  valid  we  include  this  step  in  our  next  example. 

Example  4.6  Suppose  the  Hessian  at  a  critical  point  P  is 

2  9  1  \ 

3  2  0 

2  4  2' 

0  2-1/ 

Since  the  diagonal  contains  positive  and  negative  terms  P  is  a  saddle  point.  We  see 
this  more  clearly  if  we  interchange  row  2  and  row  4  and  column  2  and  column  4. 
We  obtain  the  matrix  B  below  and  det(Z?2)  <  0. 

The  presence  of  9  in  the  third  column  suggests  we  interchange  the  second  and 
third  rows  and  the  second  and  third  columns.  This  gives  the  matrix  C  below  with 
det(C2)  <  0  and  confirms  that  P  is  a  saddle  point. 


2y  2x  0  \ 
2x  2 z  2  y  j 
0  2y  2  J 


=  -2-4  <  0 
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1  9  2 


2  9  2  1 


B  = 


1 

-1  2 

0 

,c  = 

9 

4 

2 

1 

9 

2 

4 

2 

2 

2 

3 

0 

\2 

0 

2 

3/ 

u 

2 

0  -1  / 

The  standard  approach  gives  det(Ai)  =  2,  det(A2)  =  2,  det(A3)  =  — 171  and 
det(Ai)  det(A3)  <  0  implies  that  P  is  a  saddle  point. 

The  above  method  does  not  identify  absolute  or  global  maxima  and  minima.  We 
encountered  a  similar  problem  in  Example  3.3.  We  now  describe  a  useful  method 
which  can  be  applied  to  certain  functions  on  convex  open  sets.  A  subset  U  C  M/7  is 
convex  if  the  straight  line  joining  any  two  points  in  U  is  contained  in  U.  The  interior 
of  a  circle,  sphere,  box,  polygon,  the  first  quadrant  or  octant,  and  the  upper  half-plane 
are  typical  examples  of  convex  open  sets.  The  exterior  of  a  circle  or  polygon  is  not 
convex. 

Suppose  /  :  U (open,  convex)  — >  R  has  continuous  first  and  second  order 
partial  derivatives  at  all  points.  If  P,  Q  e  U  and  v  =  Q  —  P  then,  by  convexity, 
g(x)  =  f(P  +  xv)  is  defined  on  an  open  interval  in  R  and  is  the  restriction  of 
/  to  that  part  of  the  straight  line  through  P  and  Q  which  lies  in  U.  Moreover, 

9  f  d2  f  d2  f 

g'(x)  =  — (P  +  xv)  and  — T(P  +  xv)  =  g"(x).  Suppose  — «-(X)  0  for  all 

X  in  U  and  all  v  ^  0.  By  the  Intermediate  Value  Theorem  gr\x)  is  either  always 
positive  or  always  negative.  This  implies  that  g'  is  either  strictly  increasing  or  strictly 
decreasing  and  in  either  case  g  has  at  most  one  critical  point  on  every  line  in  U .  Since 
we  can  carry  out  this  analysis  for  any  pair  of  points  P  and  Q  in  U  this  shows  that 
/  has  at  most  one  critical  point  in  U.  Suppose  /  has  a  local  maximum  at  P.  If 
/OP)  <  /(G)  then  the  function  g  considered  above  must  have  a  local  minimum  at 
some  point  on  the  line  joining  P  and  Q.  This  contradicts  the  fact  that  g  has  at  most 
one  critical  point  and  shows  that  /  has  an  absolute  maximum  over  U  at  P.  Similarly 
if  f  has  a  local  minimum  at  P  then  it  has  an  absolute  minimum  over  U  at  P.  To 
92/ 

verify  that  — 3-  (X)  7^  0  when  U  is  a  convex  open  subset  in  W1  it  suffices  to  show 

9vz 

that  det (///(*  y^)  is  never  zero. 


Example  4. 7  Let 


fix,  y)  =x  +  y  -e 


_  _  ~x+y 


for  all  (x,  y)  in  M2.  We  have 


V/(x,  y)  =  (1  -  ex  -  e*+y,  1  -  e~v  -  ex+^), 


If  (x,  y)  is  a  critical  point  of  /  then  e*  =  1  —  ex+v  =  ev  and  x  =  y.  Hence 
e2x  +  e*  —  1  =  0.  If  w  =  e*  then  w2  +  w  —  1  =0  and  w  =  (—1  =L  \/5)/2.  Since 
w  =  eA  >  0  we  have  only  one  solution  w  =  (—  1  +  V5)  /2  and  /  has  just  one  critical 
point  at  P  =  ((—1  +  V5)/2,  (— 1  +  \/5)/2).  The  existence  of  a  single  critical  point 
suggests  that  we  consider  the  Hessian  at  all  points.  We  have 
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Hf(x,y) 


( 


e*  -  ex+y 

-ex+y 


—cx+y  \ 
— e-v  -  ex+y  )  ' 


Hence  det =  (e*  +  e^Xe?  +  e**?)  -  e2jc+2^  =  e2jc+^  +  e^+2*  +  ex+y 

,  92/ 

is  positive  at  all  points  (x,  y)  e  M.  Since  — ^  (P)  <  0,  /  has  a  local  and  hence  a 


global  maximum  at  P. 


dx2 


Exercises 

4. 1  Classify  the  non-degenerate  critical  points  of 

(a)  x2  +  vy  +  2x  +  2y  +  1  (f)  xyzt~xl~yl~zl 

(b)  v3  +  y3  -  3xy  (g)  xy  +  v2z  -  v2  -  y  -  z2 

(c)  v3z  -  192v  +  y2  -  yz  (h)  v2y  +  y2z  +  z2  -  8x/2v 

(d)  (2  -  x)(4  -  y)(x  +  y  -  3)  (i)  2v2y2  -  z2  +  x  -  2y  +  z 

(e)  4xyz  —  x4  —  y4  —  z4  (j)  x2  —  xy  +  yz3  —  6z. 

Show,  using  convexity,  that  the  function  in  (b)  has  an  absolute  minimum  over 
the  set  U  =  {(x,  y)  :  v  >  1/2,  j  >  1  /2}.  Show  using  the  exhaustion  method 
outlined  in  Example  3.3  that  the  function  in  (e)  has  an  absolute  maximum  and 
no  absolute  minimum  over  M3 .  Show  that  the  function  in  (f)  has  an  absolute 

maximum  and  an  absolute  minimum  over  M3 . 

2  2  2 

4.2  If  f{x,  y,  z)  =  (ax2  +  by 2  +  cz2)e-x  and  a  >  b  >  c  >  0  show  that 

the  function  has  two  local  maxima,  one  local  minimum  and  four  saddle  points. 
Find  the  maximum  and  minimum  of  /  over  M3 . 

4.3  Show  that  the  function  vyz  (v  +  y  +  z  —  1)  has  one  non-degenerate  critical  point 
and  an  infinite  set  of  degenerate  critical  points.  Show  that  the  non-degenerate 
critical  point  is  a  local  minimum. 

o  o  o 

x  +  y  +  z 

4.4  Show  that  every  critical  point  of - is  degenerate. 

vyz 

4.5  Find  the  distance  from  the  point  (—1,  1,  1)  to  the  level  set  z  =  xy. 

4.6  Fet  U  =  (fli,  b\)  x  (<22,  £>2)  x  •  •  •  x  (an,  bn)  C  W1  and  let 

M(U)  =  {/ :  U  — >  R  :  all  first-  and  second-order  partial 

92/  92/ 

derivatives  of  /  exist  and - (P)  =  - (P) 

(J  Xf  (J  X  j  (J  X  j  (J  Xf 

for  allP  g  t/and  all  i,  j ,  1  <  i,  j  <  n\. 


Show  that  M(U)  has  the  following  properties: 
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(i)  if  /,  g  e  M(U)  and  c  e  R  then  f  ±  g,  f  •  g  and  c  •  /  e  M(U)  and  if 
g  7^  0  then  f/g  e  M(U ) 

(ii)  if  /  g  M(U)  and  0:  R  — >  R  is  twice  continuously  differentiable  then 

0o/  6  M(U). 

Using  (i)  and  (ii)  show  that 


2  /  \ 

*(w,  =  sm  Ir  +  .-^  +  ij 

lies  in  M(U).  Verify  this  result  by  calculating  directly  the  appropriate  second- 
order  partial  derivatives  of  h. 

4.7  Let  Y\,  ... ,  Ym  be  m  points  in  W1.  Show  that  X/=i  \\X  —  U  II 2  achieves  its 
absolute  minimum  at  X  =  ^  X/=i  ^  •  Interpret  your  result  geometrically. 

4.8  If  z  =  0(v,  y)  satisfies  the  equation 

x2  +  2y2  +  3z2  —  2  xy  —  2yz  =  2 

find  the  points  (x,  y)  at  which  0  has  a  local  maximum  or  a  local  minimum. 

4.9  If  f  :Rn  — >  R  has  continuous  first  and  second  order  partial  derivatives  show 

that  f"(xu  . . .  ,xn)  =  i,...,xn)  =  ///(*!,...,*„)• 

4.10  Show  directly  that  the  set  F~  1  (2,  1)  in  Example  3.1  is  compact. 


Chapter  5 

Curves  in  M" 


Summary  We  introduce  and  discuss  the  concept  of  directed  curve  in  M77 .  We  obtain  a 
formula  for  the  length  of  a  curve,  prove  the  existence  of  unit  speed  parametrizations 
and  define  piecewise  smooth  curves. 

Directed  and  parametrized  curves  play  a  role  in  many  of  the  topics  discussed  in  the 
remaining  chapters  of  this  book,  e.g.  line  integrals,  existence  of  a  potential,  Stokes’ 
theorem  and  the  geometry  of  surfaces  in  M3,  and,  furthermore,  in  a  simple  fashion 
introduce  us  to  concepts  such  as  parametrizations  and  orientations  that  are  later 
developed  and  generalised  in  more  involved  settings. 

We  begin  by  giving  a  rigorous  definition  of  directed  curve.  This  may  appear 
complicated  and  unnecessarily  cumbersome  at  first  glance  and  so  we  feel  it  proper  to 
elaborate  on  why  each  condition  is  included.  It  is  always  important  in  mathematics 
to  understand  the  basic  definition  and  to  refer  to  it  until  one  appreciates  each  part 
separately  and  the  totality  of  parts  collectively.  As  progress  is  achieved  there  is  usually 
less  need  to  refer  to  the  definition  but  in  case  of  ambiguity  the  definition  is  the  book 
of  rules.  The  only  requirement  in  a  definition  is  that  it  be  consistent,  i.e.  that  the 
various  conditions  do  not  contradict  one  another  or  the  rules  of  mathematics.  Apart 
from  this  there  is  freedom  in  the  choice  of  conditions  and,  indeed,  many  books  would 
give  a  slightly  different  definition  of  directed  curve.  The  differences  depend  on  the 
degree  of  generality  sought,  the  results  aimed  at  and  the  methods  used.  However,  all 
definitions  of  curve,  directed  curve  and  parametrized  curve  contain  the  same  essential 
features. 

Definition  5.1  A  directed  (or  oriented)  curve  in  M77  is  a  quadruple  {r,  A,  B, 
v}  where  r  is  a  set  of  points  in  M77;  A  and  B  are  points  in  r,  called  respectively  the 
initial  and  final  points  of  F;  v  is  a  unit  vector  in  M77  called  the  initial  direction ,  for 
which  there  exists  a  mapping  P  :  [a,  b]  — >  M77,  called  a parametrization  of  r,  such 
that  the  following  conditions  hold: 

(a)  there  exists  an  open  interval  /,  containing  [a,  b]  and  a  mapping  from  I  into  M77 
which  has  derivatives  of  all  orders  and  which  coincides  with  P  on  [a,  b] 

(b)  P([a,  b])  =  r,  P(a)  =  A,  P(b)  =  B  and  P'(a)  =  ay  for  some  a  >  0 
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(c)  P'(t)  7^  0  for  all  t  e  [a,  b] 

(d)  P  is  injective  (i.e.  one  to  one)  on  [a,  b)  and  (a,  b]. 

Condition  (a)  is  a  rather  strong  regularity  condition — we  use  the  first  derivative  to 
define  the  tangent,  the  second  to  define  curvature,  the  third  to  define  torsion  and  the 
fourth  to  obtain  the  Frenet-Serret  equations,  and  at  this  stage  we  felt  it  was  just  as  easy 
to  assume  that  we  had  derivatives  of  all  orders.  We  also  wished  to  have  derivatives 
at  the  end  points  of  the  interval  [a,  b]  and  for  this  reason  we  assumed  that  P  has 
an  extension  as  a  smooth  function  (i.e.  as  a  function  with  derivatives  of  all  orders) 
to  an  open  interval  containing  [a,  b].  We  could  achieve  precisely  the  same  degree 
of  smoothness  by  using  one-sided  derivatives  at  a  and  b  but  felt  this  appears  even 
more  complicated.  All  definitions  of  a  curve  will  include,  as  an  essential  feature  a 
continuous  mapping  P  from  an  interval  I  in  M  onto  r .  The  degree  of  differentiability, 
whether  the  interval  I  is  open,  closed,  finite  or  infinite  may  be  regarded  as  options 
that  are  available.  We  have  chosen  the  options  that  suit  our  purposes. 

The  essential  feature  of  condition  (b),  as  we  have  just  noted,  is  P([a,  b])  =  r. 
The  remaining  parts  endow  the  set  r  with  a  sense  of  direction.  If  A  ^  B  then  the 
conditions  P(a)  =  A  and  P(b)  =  B  define  a  direction  along  r  and  in  this  case  the 
condition  P’ (a)  =  a\  is  redundant  as  v  is  determined  by  {F,  A,  B]  (Fig.  5.1). 

If,  however,  A  =  B  then  we  have  a  closed  curve ,  and  it  is  necessary  to  distinguish 
between  the  two  directions  we  may  travel  around  r .  In  the  case  of  curves  in  M2  we 
have  clockwise  and  anticlockwise  or  counterclockwise  directions.  In  W1  we  do  not 
have  such  a  concept  and  instead  specify  the  direction  along  the  curve  by  giving  an 
initial  direction  v  (Fig.  5.2). 

Note  that  when  we  know  r  and  A  then  v  =  ±P' (a)  /  \\  P' (a)  \\  and  the  condition 
a  >  0  distinguishes  between  the  two  signs  and  fixes  the  direction.  Condition  (c)  is 
necessary  to  obtain  a  unit  speed  parametrization  and  we  have  already  used  P'(a)  ^  0 
in  (b).  This  condition  excludes  curves  with  corners  but  we  get  around  this  problem 
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by  defining  piecewise  smooth  curves.  These  are  obtained  by  placing  end  to  end  a 
finite  number  of  directed  curves.  The  transition  from  directed  curves  to  piecewise 
smooth  curves  is  painless.  Later  we  will  require  P"{t)  ^  0  in  order  to  define  a  unit 
normal  to  a  directed  curve  in  M3 . 

Since  we  allow  A  =  B  it  follows  that  the  mapping  P  may  not  be  injective  on 
[a,b].  However,  we  do  not  wish  the  curve  to  cross  itself  (Fig.  5.3a)  or  to  half  cross 
itself  (Fig.  5.3b)  as  these  lead  to  unnecessary  complications  and  we  have  included 
condition  (d)  to  exclude  such  possibilities. 

A  continuous  mapping  P  :  [a,  b]  W1  which  satisfies  (a),  (c)  and  (d)  is  called  a 
parametrized  curve.  A  parametrized  curve  determines  precisely  one  directed  curve 


P([a,  b ]),  P(a ),  P(b), 


P'ia) 
P'(a)  ||/ 


for  which  it  is  a  parametrization. 

Since  the  terminology  “directed  curve”  and  the  notation  {F,  A,  B,  v}  are  rather 
cumbersome  we  will  use  the  term  curve  and  the  notation  r  in  all  cases  where  there 
is  little  danger  of  confusion.  If  A  ^  B  we  sometimes  write  {F,  A,  B}.  If  we  need  to 
use  coordinates  we  usually  let  P(t)  =  (vi  (t), . . .  xn(t))  if  F  e  W1  and,  when  n  =  3, 
we  let  P(t)  =  (v(t),  y(t),  z(t)).  It  is  helpful  to  think  of  [a,  b]  as  an  interval  of  time 
and  P(t)  as  the  position  of  a  particle  at  time  t  as  it  travels  along  the  route  F  from  A 
to  B. 

We  call  Pf(t)  the  velocity  and  ||P'(t)||  the  speed  at  time  t.  Since  distance  = 
speed  x  time  the  formula 

b 

l(D  =  J  \\P'(t)\\dt 

a 

where  / (F)  is  the  length  of  F,  is  not  surprising.  We  shall,  however,  pause  to  prove  this 
formula  in  order  to  show  the  usefulness  of  vector  notation.  Since  P  is  differentiable 
we  have  for  all  t  and  t  +  At  in  [a,  b] 


P (t  T  At)  —  F(t)  T-  P  (t)At  T-  g(t,  At)  •  At 
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a 
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Fig.  5.4 


where  g(t,At)  — >►  0asZ\t  — >  0  for  any  fixed  t.  Hence  P(t  +  At)  —  P(t)  &  P'(t)At 
for  At  close  to  zero.  If  we  partition  [a,  b]  we  get  a  corresponding  partition  of  r  and 
an  approximation  of  the  length  of  r  (Fig.  5.4). 

We  have 


kd  *  ^ii/ja!+i)-Jp(/i)n 

i 

«  Xnp/v)ii^- 

i 

b 

- >  J  \\P'(t)\\dt 

a 

as  we  take  finer  and  finer  partitions  of  [a,  b].  In  terms  of  coordinates  we  have  P(t)  = 
. . . ,  xn(t)),  P\t)  =  . . . ,  x'n(t))  and 


b  b 

Pf (t)\\dt  =  J (x[(t)^  +  •  •  •  +  x'n (0^)  ^  dt . 

a  a 


Example  5.2  Let  P(t)  =  cosd  +  sintj  +  ^k,  t  e  [0,  2tt ],  denote  a  parametrized 
curve  r  in  M3,  where  i,  j  and  k  denote  the  standard  unit  vector  basis  in  M3,  i.e. 
i  =  (1,  0,  0),  j  =  (0,  1,  0)  and  k  =  (0,  0,  1).  The  curve  r  is  part  of  a  helix — it 
spirals  around  a  vertical  cylinder  (Fig.  5.5). 

If  we  consider  only  the  first  two  coordinates,  this  amounts  to  projecting  onto  the 
R2  plane  in  M3,  we  get  the  standard  parametrization  t  — (cost,  sint)  of  the  unit 
circle.  Hence,  disregarding  the  final  coordinate,  which  we  take  to  be  the  height,  the 
particle  appears  to  move  in  a  circle.  We  have 

P\t)  =  —  sin  ti  +  cos  tj  +  k. 

The  rate  of  change  of  the  height  is  given  by  the  coefficient  of  the  k  term  and  so  the 
particle  is  rising  with  constant  speed.  We  have 
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Fig.  5.6 


and 


1  /2 

Pr(0 1|  =  ^(—  sin  t )2  +  (cos  02  +  l2)  =  V2 


It: 


l(D  =  j  V2dt  =  2n/2t r. 


0 

If  P :  [a,  /?]  — >  M/7  is  a  parametrization  of  the  directed  curve  r  we  define  the  length 
function  by  the  formula 

t 

,(«)=/  II 

a 

for  all  £  e  [a,  Z?].  If  /  =  /(F)  then  5 :  [a,  Z?]  — >  [0,  /]  and,  by  the  one-variable 
fundamental  theorem  of  calculus,  s'(t)  =  ||  P'(t )  ||  >  0.  Hence  s  is  strictly  increasing, 
s~l :  [0,  /]  — >  [a,  b]  has  derivatives  of  all  orders  on  [0,  /]  and  P  o  s~l  maps  [0,  /] 
onto  r  (Fig.  5.6).  For  the  inverse  function  s~l  we  have 


i.S~X)\t)  = 


1 


1 
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and  hence 


||(P  os_1)'(0||  = 


P/(s~1(0)ll 


P/(J~1(Q)II  =  1 
^-‘(0)11 


Since  the  remaining  conditions  (for  a  parametrization)  are  easily  checked  it  fol¬ 
lows  that  P  os~{  is  a  parametrization  of  r  and  we  have  proved  the  following  result. 

Proposition  5.3  Directed  curves  admit  unit  speed  parametrizations. 


If  two  particles  start  at  the  point  A  at  time  zero  and  both  proceed  along  r  towards 
B  at  unit  speed  then  their  positions  on  r  at  time  t  will  always  agree.  This  shows  that  a 
directed  curve  of  length  /  admits  a  unique  unit  speed  parametrization  on  [0,  /].  Using 
our  construction  of  unit  speed  curve  we  may  restate  this  as  follows:  if  P\ :  [a ,  b]  — >  T 
and  P2  :  [c,  d]  — >  r  are  any  two  parametrizations  of  the  directed  curve  r,  of  length 
/,  and 

t  t 


s\(t)  = 


P[(x)\\dx, 


si{t)  = 


P^ix) \\dx 


a 


c 


are  the  associated  length  functions  then 

P\  o  s^l(t)  =  P2  o  s^it) 


for  all  t  e  [0,  /]. 

To  include  curves  with  corners  we  extend  the  concept  of  directed  curve  to 
that  of  piecewise  smooth  directed  curve.  A  finite  collection  of  directed  curves 
{U ,  Ai ,  Bi ,  Vi}J7=1  is  called  a  piecewise  smooth  directed  curve  if 

(a)  each  {/] ,  A  / ,  Bi ,  Vi}  is  a  directed  curve, 

(b)  Bi  =  Ai+ 1  for  i  =  1, . . . ,  n  —  1  (i.e.  the  final  point  of  7~J  coincides  with  the 
initial  point  of  /]+ 1). 

If  Bn  =  A 1  we  say  that  the  piecewise  smooth  directed  curve  is  closed .  We  use 
the  notation  r  for  a  piecewise  smooth  directed  curve  and  A  and  B  for  its  initial  and 
final  points  respectively.  This  definition  is  rather  general  and  apart  from  curves  with 
corners  it  also  includes  curves  which  cross  one  another.  In  general  such  curves  do  not 
admit  a  unit  speed  parametrization  but  it  can  be  shown  that  there  exists  a  continuous 
parametrization 

n 

P:  [a,b] - >  ^  =  LU' 

i  =  \ 

and  a  partition  {ao  =  a,  a\,  02,  . . . ,  an  =  b]  of  [a,  b]  such  that  P([ai- 1,  a /])  =  TJ, 
P(ai- 1)  =  Ai,  P(at )  =  5j,  i  =  1,  ...  ,n,  P  has  derivatives  of  all  orders  on  [a,  b] 
and  P'(t)  7^  0  for  t  7^  ao,  . . . ,  an. 
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A  piecewise  smooth  directed  curve  r  =  {FJ ,  A/ ,  ,  Vi}”=1  is  studied  and  applied 

by  considering  each  of  the  component  sections  {F/,  A/,  F/,  Vi}  in  turn  (see  for 
instance  Exercise  5.6  and  our  method  of  finding  a  potential  in  Chap.  6). 


Exercises 

5.1  Find  the  length  of  the  curve  parametrized  by 

P(t)  =  (2cosh3t,  — 2sinh3t,  6t),  0  <  t  <  5. 


5.2  Show  that  the  following  parametrizations  are  unit  speed 


(a)  P\(s)  = 


(b)  P2(s)  = 


5  €  [0,  1] 

/(l+j)3/2 

V  3 


(c)  P3(s)  =  ^ (cos  \s) 


+  1,  (s  +  \A2  +  1)  l,  x/21o g(s  +  Vs2  + 
(1  —  s)3/2  s  \ 

’  3  ’  7f)’  5  e  [_1,+1] 

—  syf  1  —  s2,  1  —  s2,  0),  s  e  [0,  1]. 


5.3  Let  r  and  h  denote  positive  numbers.  Find  a  unit  speed  parametrization  of  the 
helix  P(t)  =  (r  cost,  r  sint,  ht),  0  <  t  <  6tt. 

5.4  Obtain  unit  speed  parametrizations  of  the  curves  defined  by 


(a)  t - >  (er  cos  t,  er  sin  t,  er),  t  e  [0,  1] 

(b)  t - >  (cosht,  sinht,  t),  t  e  [0,  1]. 

5.5  Parametrize  the  curve  of  intersection  of  the  sphere  x2  +  y2  +  z2  =  16  and  the 
cylinder  x2  +  (y  —  2) 2  =  4  which  lies  in  the  first  octant. 

5.6  Parametrize  the  anticlockwise  directed  triangle  in  Mr  with  vertices  (1,2), 
(—1,  —2)  and  (4,  0)  as  a  piecewise  smooth  curve. 

5.7  Find  the  closest  points  on  the  curve  x2  —  y2  =  1  to  (a,  0)  where  (i)  a  =  4 
(ii)  a  =  2  (iii)  a  =  a/2. 

5.8  Let  /  denote  a  real- valued  differentiable  function  defined  on  an  open  subset  U 

of  Mn  and  suppose  V/(X)  /  0  for  all  X  in  U.  Let  P  denote  a  parametrized 

d  , 

curve  in  U.  Use  the  chain  rule  to  show  that  — (/  o  P)(t)  =  (V/(P(t)),  P  (t)} 

dt 

and  hence  deduce,  using  the  Cauchy-Schwarz  inequality,  that  V/  (Xo)  gives  the 
direction  of  maximum  increase  of  /  at  Xo.  Show  that  ||  V/  (Xo)  ||  is  the  maximum 
rate  of  increase. 

5.9  If  T :  Mn  M11  is  a  linear  mapping  such  that  ||rX||  =  ||X||  for  all  X  e  Mn 
show  that  T  preserves  the  inner  product,  angles,  area  and  the  length  of  curves. 
When  n  =  3,  show  that  T  preserves  the  cross  product. 


Chapter  6 

Line  Integrals 


Summary  We  integrate  vector-valued  and  scalar-valued  functions  along  a  directed 
curve  in  W1 .  We  discuss  scalar  and  vector  potentials  and  define  the  curl  of  a  vector 
field  in  M3 . 

The  differential  calculus  was  developed  to  study  extremal  (i.e.  maximal  and  minimal) 
values  of  functions.  Since  it  is  only  possible  to  discuss  the  maximum  and  minimum  of 
a  real- valued  function  it  is  not  surprising  that  such  functions  occupy  a  prominent  role 
in  several- variable  differential  calculus.  However,  in  moving  to  integration  theory  it 
is  more  natural  (and  more  natural  in  mathematics  usually  means  more  useful,  more 
efficient  and  more  elegant)  to  consider  vector- valued  functions  where  the  domain 
and  the  range  share,  in  perhaps  a  loose  way,  a  common  dimension.  Formally,  we 
have  the  following  definition  of  a  vector  field. 

Definition  6.1  A  function  F  which  maps  a  subset  U  ofW1  into  W1  is  called  a  vector 
field  on  U. 

If  U  is  an  open  subset  of  Wl  and  the  vector  field  has  derivatives  of  all  orders  we 
call  it  a  smooth  vector  field  and  if  U  is  arbitrary  and  the  vector  field  is  continuous 
we  use  the  term  continuous  vector  field.  We  shall  also  use  the  notation  F  to  denote  a 
vector  field. 

The  gradient  is  an  important  example  of  a  vector  field,  i.e.  if  U  is  open  and 
/ :  U  d  M77  — >  M  is  differentiable  then  V/ :  U  W1  is  a  vector  field  on  U . 

Another  useful  example  occurs  when  T  is  a  directed  curve  in  M77  and  F  is  a 
function  which  assigns  a  vector  in  M77  to  each  point  on  T  — in  this  case  we  say  that  F 
is  a  vector  field  along  V .  For  example,  if  P  is  a  parametrization  of  a  directed  curve 
T  in  M77  then  the  mapping 


P(t)  e  r  -*  P'(t)  e  M3 
is  a  smooth  vector  field  along  T. 
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Fig.  6.1 


Vector  fields,  which  assign  vectors  to  points  in  the  domain  of  definition,  are  often 
represented  as  in  Fig.  6.1. 

This  representation  is  useful  in  locating  zeros  and  suggesting  properties  such  as 
continuity  and  smoothness.  Furthermore,  it  allows  various  physical  interpretations 
of  vector  fields,  e.g.  as  the  velocity  of  a  moving  fluid  and  the  flow  of  an  electric 
current  which  lead,  in  turn,  to  important  physical  and  engineering  applications. 

We  begin  our  study  of  integration  theory  by  defining  the  integral  of  a  vector  field 
F  along  a  directed  curve  T.  Let  P  :  [a,  b]  — >►  T  denote  a  parametrization  of  T.  To 
each  partition  of  [a,  b]  we  obtain  a  partition  of  T  (Fig.  6.2)  and  the  Riemann  sum 

•  (p^+ 1)  -  pdij)  *£F(p(t,))  ■  p'wau 

i  i 


where  At/  =  t/+i  —  t/  and  •  denotes  the  inner  product  in  M/7.  Note  that  we  are  using, 
as  usual,  the  linear  approximation  to  P(t  +  At),  P(t)  +  P'(t)  At.  If  F  is  a  continuous 
vector  field  along  T,  i.e.  if  the  mapping  t  e  [a,  b]  — >  F(P(t))  is  continuous,  then 
as  we  take  finer  and  finer  partitions  of  [a,  b]  the  Riemann  sums  converge  to  the  limit 

b 

F(P(t) )  •  P'(t)  dt. 

We  denote  this  integral  by  fr  F,  since  we  shall  shortly  prove  that  it  is  independent  of 
the  parametrization  P ,  and  call  it  the  line  integral  of  F  over  T .  In  terms  of  coordinates, 
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for  instance  in  the  case  n  =  3,  we  have 


F  =  (/,  g,  h),  P(t)  =  y(t),z(t)),  P'(t)  =  (x' (t) ,  y' (t) ,  z'(t)) 


and 


[  F=  f  [f(x(t),y(t),z(t))-x'(t)+g(x(t),y(t),z(t)]-y'(t) 
J  r  J  a 

+  h(x(t),y(t),z(t)]  -z'(t)]dt. 


This  is  frequently  written  in  the  form 


/  dx  +  g  dy  +  h  dz . 


T 


If  (T,  A,  B,  v)  is  a  directed  curve  and  f  is  obtained  by  changing  the  direction  along 
the  curve  T,i.e.  f  =  (T,  B,  A ,  — v),  then  clearly  J  F  =  —  J  F  for  any  continuous 
vector  field  F. 


Example  6.2  We  evaluate 


xydx-\-xz  dyFxyzdz 


where  the  curve  T  is  parametrized  by 


P(t)  =  (t,  t2,  t 3),  0  <  t  <  1 


In  coordinates  we  have 


x  =  x(t)  =  t 
•  y  =  y(t)  =  t 2 
z  =  z(t)  =  t3 


'  dx 

dt 

dy 

_ 

dt 

dz 

~  dt 


=  x\t)  =  1 


=  y\t)  =  It  =3> 
=  z(t)  =  3t2 


dx  =  dt 
dy  =  2^ 
r/z  =  3 12  dt 


and 


xydx+xz  dy-\-xyzdz  = 


t3  dt  T  t2  •  2t  dt  T  t^  •  3t2  dt 


JO 

rt 4  2^9 


3^9-|i 

~9~  Jo 


29 

36' 


Alternatively,  changing  to  vector  notation,  we  let  F(x,  y ,  z)  =  (xy,  xz2,  xyz ).  Then 
F(P(0)  =  (^3,  t 7,  £6)  and  Pr(0  =  (1,  2f,  3^2).  Hence 
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F(P(t))  ■  P'(t)dt  =  I  (t3 ,  t1 ,  t6)  ■  (l ,  2t ,  3t2)  dt 


[  (t3  +  2r8  +  3t8)dt  — 
In 


29 

36' 


We  return  to  the  general  situation.  If  P  :  [a,  b]  — >  T  and  Q  :  [c,  d]  ->  T  are  two 
parametrizations  of  T  with  length  functions  s  and  s\  respectively,  then,  as  we  saw  in 
Chap.  5, 

P  o  s~l  =  Q  o  Sj-1  on  [0,  /],  /  =  length  of  T. 

Using  the  one-variable  change  of  variables,  y  =  s-1(0  and  v  =  sj-1^),  we  obtain 

/ 

f(p  os~\t))  •  (Por1)^)^ 

/ 

F(P  ols,_1(t))  •  P/(ks,_1(0)(^_1)/(0 dt 
b 

F(P(y))-P\y)dy. 

Similarly 

f(Q  °  s^it))  ■  (Q  O  syy(t)  dt  =  J  F{Q{x))  ■  Q'(x)  dx. 

Since  P  o  s-1  =  Q  o  s  j-1  this  implies 


[  F(P(y))  ■  P'(y)  dy  =  f  F(Q(x ))  •  Q'(x)  dx 
J  a  J  c 

and  we  get  the  same  value  no  matter  which  parametrization  is  used.  This  justifies  the 
notation  fr  F . 

If  P  :  [a,  b]  ->  T  is  a  parametrization  of  the  directed  curve  T  in  Wl  and  t  e  [a,  b] 
let  T(t)  =  P\t)  /  ||P'(0||.  We  call  T(t)  the  unit  tangent  to  the  (directed)  curve 
at  P(t).  It  is  easily  seen  that  any  two  parametrizations  define  the  same  unit  tangent 
vector  at  each  point  of  T.  This  leads  to  another  way  of  writing  line  integrals.  If  F  is 
a  vector  field  along  T  (always  of  course  assumed  to  be  continuous)  then 


F(P(t))  •  P\t)dt 


F{P(t ))  • 


P'(t) 


P'(t) ||  dt 
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b 

F(P(t))-T(t )  || />'(?) ||  dt 
b 

(F  •  T)ds 

and  when  written  in  this  form  one  should  remember,  in  applying  a  parametrization 
P,  that  F  and  T  are  both  evaluated  at  P(t)  and  ds  =  ||  P'(t)  ||  dt. 

Real-valued  functions  or  scalar  fields,  such  as  the  speed  of  a  parametrization, 
can  also  be  defined  along  a  directed  curve  T.  Since  these  are  not  endowed  with  a 
sense  of  direction  we  cannot  apply  directly  our  definition  of  integral.  Fortunately,  we 
have  just  observed  a  special  or  privileged  direction  associated  with  each  point  on  a 
curve,  the  tangent  direction,  and  by  associating  the  continuous  real-valued  function 
/ :  T  ->  R  with  the  vector  field  fT :  T  — >  W1  we  can  define  fr  f .  The  privileged 
direction  on  an  oriented  surface  in  M3  is  the  normal  direction  and  in  this  case  it  is 
also  possible  to  consider  scalar- valued  integration  as  a  special  case  of  vector- valued 
integration. 

If  P  :  [a,  b]  — >►  T  is  a  parametrization  of  the  directed  curve  V  we  define 


[  /=  [  fT=  f  f{P(t))T(t)-T(t)\\P'(t)\\dt 
Jr  Jr  Ja 

=  f f(p(t))\\p’(t)\\dt 

J  a 


(6.1) 


Because  of  (6.1)  we  sometimes  write  fr  fids  in  place  of  fr  /. 

We  now  seek  to  identify  those  vector  fields  in  W 1  which  are  the  gradient  of  a 
scalar-valued  function.  Vector  fields  of  this  kind  are  called  conservative  and  are 
said  to  have  a  scalar  potential.  If  V/  =  F  we  call  /  a  scalar  potential  of  F.  Our 
investigation  of  this  problem  leads  to  a  generalisation  of  the  fundamental  theorem  of 
one-variable  calculus 

b 

g'(t)dt  =  gib)  -  g(a) 


where  g  is  continuous  on  [a,  b]  and  differentiable  on  (a,  b). 

We  begin  by  considering  properties  of  the  gradient  of  / :  U  C  M,?  — >  R.  If  T  is 
a  directed  curve  in  U  parametrized  by  P  :  [a,  b]  — >  W1  then 


Vf(P(t))-P'(t)dt 


J-if  O  P)(t)  dt 
dt 


(chain  rule) 


foP(t) 


b 

a 


(fundamental  theorem  of  calculus  in  R) 


f{P(b))-f{P(a)). 
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Fig.  6.3 


Thus  the  line  integral  of  V/  along  T  depends  only  on  the  values  of  /  at  the  initial 
and  final  points  of  the  curve.  This,  as  we  shall  see  in  Proposition  6.3,  characterises 
vector  fields  which  have  a  scalar  potential.  If  T  is  a  piecewise  smooth  directed  curve 
in  W1  which  is  the  union  of  directed  curves  (Ti)f=1  and  F  is  either  a  vector  field 
along  T  or  a  real-valued  function  on  T  we  let 


In  the  next  proposition  it  is  necessary  to  assume  that  any  pair  of  points  in  the  open 
set  U  can  be  joined  by  a  piecewise  smooth  directed  curve  which  lies  in  U;  an  open 
set  of  this  kind  is  said  to  be  connected . 

Proposition  6.3  Let  F :  U  C  M/z  — >  W1  denote  a  continuous  vector  field  on  the 
connected  open  subset  U  ofW1.  If  for  any  two  points  A  and  B  in  W1  and  any  two 
piecewise  smooth  directed  curves  T  i  and  T 2  joining  A  and  B  we  have 


F 


then  F  has  a  potential. 

Proof  Let  A  denote  a  fixed  point  in  U .  For  any  X  in  U  let  f(X)  =  fr  F  where  T 
is  any  piecewise  smooth  directed  curve  in  U  joining  A  to  X.  By  our  hypothesis  /  is 
well  defined,  i.e.  there  is  no  ambiguity  in  the  definition.  Let  F  =  (/j, . . . ,  fn)  and 
let  T  denote  a  curve  joining  A  to  X.  Fix  j,  1  <j<n,  and  let  Y  =  he  j  where  h 
denotes  a  real  number  close  to  0.  Let  T 1  denote  the  directed  curve  parametrized  by 
P(t)  =  X  +  tY,  0  <  t  <  1.  Then  T 1  joins  X  to  X  +  Y  and  T  U  V\  is  a  piecewise 
smooth  directed  curve  in  U  which  joins  A  to  X  +  Y  (Fig.  6.3). 
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Hence 


f(X  +  Y) 


F 


F(P(t))  ■  P'(t)dt. 


Since  P'(t )  =  Y  =  hej 


f(X  +  Y)-f(X)=  F(P(t))-hejdt 


h  [  fj(X  +  tY)dt 

Jo 

hfj(X)  +  h  M  ( X  +  tY)-fj(X )) 


dt. 


As  F  is  continuous,  each  component  in  F  and,  in  particular,  fj  is  also  continuous. 
Hence 


max 

0<r<l 


fj(XFtY)-  fj(X) 


->  0  as  /z  — >  0 


and 


j  (fj(X  +  tY)~  fj(X))dt 


)  dt 

max 

) 

0<z<  l 

fj(X  +  tY)  —  fj(X) 


We  have  shown 


f(X  +  hej)  =  f(X)  +  hfj(X)  +  g(X,  h)h 


where  g(X,  h)  — >  0  as  h 
proof. 


9/ 

0.  Hence  - (X)  =  fj(X)  and  this  completes  the 


7 


□ 


Proposition  6.3  can  be  used  to  find  potentials  but  is  not  very  practical  for  showing 
existence.  We  need  a  simpler  method  which  follows  from  the  observation: 

if  F  =  (/i ,  . . . ,  /„)  is  a  continuously  differentiable  vector  field  with  potential  /, 

9/ 

i.e.  fj  =  - —  for  all  j ,  then  for  all  i  and  j 


dx 


J 


d_(^L\=  dll 

dxj  dxi  dxj  dxi 


d 

dxi 


Bf 

dx 


The  converse  is  true  for  suitable  open  sets — the  whole  space  M/7  is  always  suitable 
and  so  also  is  any  convex  set  in  M/7.  In  Mr  an  open  set  U  is  suitable  if  and  only 
if  the  “interior”  of  any  closed  curve  in  U  also  lies  in  U\  roughly  speaking  this 
means  that  U  contains  no  holes.  In  particular  the  open  set  M2\{(0,  0)}  is  not  suitable. 


62 


6  Line  Integrals 


However,  in  M3  the  whole  space  with  a  finite  number  of  points  removed  is  suitable. 
These  examples  show  that  the  concept  of  suitability  is  rather  subtle.  The  proof  of  the 
following  result  involves  Green’s  theorem  (Chap.  9)  and  Proposition  6.3. 

Proposition  6.4  If  F  =  (f\ ,  . . . ,  fn)  is  a  continuously  differentiable  vector  field  on 
a  suitable  open  set  in  W1  then  F  has  a  potential  if  and  only  if 


(6.2) 


for  all  i  and  j. 

Example  6.5  We  wish  to  show  that  F(x,y,z)  =  (yez,  xez,  vyez)  has  a  potential. 
Let  F  =  (/i,  /2,  ff).  To  apply  Proposition  6.4  we  must  verify  (6.2),  that  is  we  must 
show 

9/i  _  9/2  9/l  —  2L  2L  —  2L 

dy  dx  ’  dz  dx  dz  dy 

We  have 

9/1  z  9/2  9/!  z  9/3  9/2  z  9/3 

- =  e  =  - ,  - =  ye  =  - ,  -  =  xe  =  - 

dy  dx  dz  dx  dz  dy 


and  hence  F  has  a  potential  on  M3 . 

The  following  is  probably  the  simplest  way  to  find  a  potential.  If  /  is  a  potential 
for  F  then 


V 

dx 


(6.3) 


Hence 


yez  dx  =  xyez  +  </>(y,  z) 


where  <p  is  the  constant  of  integration  with  respect  to  x  which  may,  however,  depend 
on  y  and  z.  Differentiating  with  respect  to  y  we  get 


9/  z  ^  90 

=  ve  + 


dy 


dy 


9/ 

and  comparing  this  with  the  formula  for  —  in  (6.3)  we  have 

dy 


Z  90 

xe  H - =  xe 

dy 


and 


dfi 


=  o. 


dy 
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Fig.  6.4 


0  ,y,z) 


Hence  <fi  does  not  depend  on  y  and  we  let  (p(y,  z)  =  Now  differentiating 

f(x,  y,  z)  =  Xje2  +  1 j/{z) 
and  comparing  this  with  (6.3)  gives 

9/  Z  ,  /  //  x  Z 

—  =  xye  +  i/s  (z)  =  xyt 
dz 

and  x/s'iz)  =  0.  This  implies  that  x/z  is  a  constant  and  we  have  shown 

f(x,y,z )  =xyez  +c 


for  some  constant  c. 

We  can  also  use  Proposition  6.3  to  find  a  potential  /.  This  proposition  tells  us 
that 

f(x,y,z )  =  J  F 

where  T  is  any  piecewise  smooth  directed  curve  in  M3,  joining  a  fixed  point  to 
(x,  y,  z),  is  a  potential  of  F.  We  take  the  fixed  point  to  be  the  origin  in  M3  and  a 
piecewise  smooth  curve  T  consisting  of  three  straight  lines  parallel  to  the  axis  T  i , 
T 2,  T3.  Specifically  we  use  the  following: 

Ti  joins  (0,  0,  0)  to  (x,  0,  0),  P\(t)  =  (t,  0,  0),  P[{t)  =  (1,  0,  0),  0  <  t  <  v 

r2  joins  (x,  0,  0)  to  (x,  y,  0),  P2(0  =  ( x ,  t,  0),  P^t)  =  (0,  1,0),  0  <  t  <  y 

T 3  joins  (v ,  y ,  0)  to  (v ,  y ,  z) ,  Pi(t)  =  (x,y,t),  F3'(0  =  (0,0,  1),  0  <  t  <  z 

(see  Fig.  6.4). 
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Then 


fix,  y,  z)  = 


F+  I  F+  I  F 

Ti  Jr2  J  r3 

fl(Pl(t))dt+[  f2(P2(t))dt+  [  f3(P3(t))dt 
'o  Jo  Jo 

*x  py  pz 

fi(t,0,0)dt+  f2(x,t,0)dt+  f3(x,y,t)dt 

o  Jo  Jo 

py  pz 

=  I  0  •  dt  +  /  ve°  dt  +  /  xye*  dt 


ro 


nz 


=  0  +  xy  +  [xy  -er]0 
=  x_y  +  xyez  —  xy 


=  xyQ^ . 


We  verify  this  result  by  noting 

9  9  9 

—  (Xyez)  =  yez  =  fu  —(Xyez)=xez  =  f2,  —(xyez)=xyez  =  f3. 
ox  dy  dz 

We  now  define  the  cross-product  of  two  vectors  in  M3 .  This  allows  us  to  present  the 
classical  notation  for  what  are  essentially  vector  derivatives.  We  shall  also  need  this 
product  when  we  discuss  the  Frenet-Serret  equations  in  Chap.  7.  If  v  =  (vi,  V2,  V3) 
and  w  =  (wi ,  W2 ,  W3)  are  two  vectors  in  M3  then  the  cross  product  of  v  and  w,  v  x  w, 
is  defined  as 


V  X  w  = 


i  j  k 

Vi  V2  V3 
W 1  W2  W3 

=  (V2W3  —  V3VV2)i  —  (Vi  W3  —  V3Wi)j  +  (viW2  —  V2Wi)k 
=  (V2W3  —  V3W2,  —  V 1  W3  +  V3W1,  V1W2  —  V2W1). 


Since  V/  =  ( 


9  9  9 


dx  ’  9y  ’  dz 

vector  field,  by  the  formula 


)(/)  we  define  cur  1(F)  or  V  x  F,  F  =  (flt  f2,  /3>  a 


•  • 

1  J 

9  9 

dx  dy 

fi  h 


k 

9 

dz 

h 


,9/3  9/2,  ,9/3  9/j,  ,9/2  9/l  V 

9y  dz  9v  dz  ^  9v  dy 


One  easily  sees  that  curl (V/)  =  0  for  any  real-valued  function  with  continuous  first- 
and  second-order  partial  derivatives  while  Proposition  6.4  says  the  converse  is  true 
on  suitable  domains,  that  is 


6  Line  Integrals 


65 


F  =  V/  <<=>►  curl(F)  =  0. 


If  F  is  the  velocity  of  a  fluid  then  curl  (F)  measures  the  tendency  of  the  fluid  to 
curl  or  rotate  about  an  axis.  Curl(F)  gives  the  direction  of  the  axis  of  rotation  and 
||curl(F)  ||  measures  the  speed  of  rotation. 

Since  the  symbolism  Vxf  has  proved  useful  we  consider  the  analogous  symbol 
V  •  F  where  the  dot  replaces  the  cross  product.  This  makes  sense  for  a  vector  field 
on  M".  If  F  =  (/i,  f2,  ■  ■  ■ ,  fn ),  we  let 


V  •  F  =  ( 


a 


a 


n 


dx\ 


dx 


-)  •  (fu  ••  fn)  =  ^ 
n  ■  ! 


i  =  l 


dfi_ 

dxi 


div(F). 


This  is  called  the  divergence  of  F  and  written  div(.F)  (see  Chap.  15).  A  vector  field 
G  is  a  vector  potential  for  the  vector  field  F  on  the  open  subset  U  of  M3  if  on  U  we 
have 

curl(G)  =  V  x  G  =  F. 


On  “suitable”  open  sets  a  vector  field  F  has  a  vector  potential  if  and  only  if  V  •  F  = 
div(F)  =  0. 

Example  6.6  We  show  that  F(X)  =  X/||X||3  has  a  vector  potential  on 

M3\{  the  z-axis}.  Let 

^  (yz,-xz,  0)  z  1  ,  m 

G(x,y,z)  {x2  +  y2){x2  +  y2+zl)l/2  ||X||2  -  Z2  '  ||X|| (y’  ’0) 

on  M3\{  the  z -axis}  =  M3\{(v,y,z)  :  x  =  y  =  0}.  We  calculate  curl(G). 
To  simplify  our  calculations  we  use  symmetry  and  the  following  result  which  follows 
immediately  from  Exercise  1.10: 


We  have 


l 


curl  (G)  =  (gi,g2,g3)  = 


a 

dx 


yz 

(■ x 2  +  y2) \\x 


j  k 

a  a 

d  y  dz 

—xz 

(x2  +  y2)  II X 


0 
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which  implies 


9I 

(  xz  \ 

x  a 

'  z  i 

dz 

k(x2  +  X2)  II X  ||  > 

x2  +  y2  dz  ' 

^  n*n  / 

x  imi) 

x2  +  y2  ||X||2 
x  (||X||2-z2) 
x2  +  y2  \\X\\3 


\\X\\3’ 

since  x2  +  y2  +  z2  =  ||X||2.  By  symmetry  g2  =  y  /  11^ II3-  Finally 


£3  = 


9 1 

<  —XZ  ) 

1  9  1 

<  yz  ) 

9*' 

^(x2  +  x2)  II  x  ||  J 

1  ay' 

^(x2  +  X2)  II X  ||  / 

(x2  +  y2)||X||  —  2x2||X||  — 


=  —Z 


(x2  +  y2)2||X||2 


(x2  +  y2)||X||  -2y2||X||  -  y2(;y} 


+ 


-z(-l) 


(x2  +  X2)2  II X  ||2 


z 


\\X\[ 


11*11 


and  we  have  shown 


curl(G)  = 


(x,y,z) 


x 


(. x 2  +  y2  +  £2)3/2  ||X|| 


=  F. 


Exercises 

6. 1  Evaluate  fr  F  where  F  is  a  vector  field  and  P  is  a  parametrization  of  the  directed 
curve  T 

(a)  F(x,  y,  z)  =  (x,  y,  z),  P(t)  =  (sinL  cos£,  t),  0  <  t  <  2n , 

(b)  F(x,  y,  z)  =  x2  dx  +  xyzdy  +  xzdz ,  P(0  =  0,  t2,  t 3),  0  <  £  <  1, 

(c)  F(v,  y,  z)  =  coszi  +  e*j  +  e^k,  F(0  =  (1,  t ,  er),  0  <  f  <  4. 

6.2  Find,  if  they  exist,  scalar  potentials  for  the  following  vector  fields 

(a)  F(x,  y,  z)  =  (2x  +  zexz ,  z,  y  +  xezx), 

(b)  G(x,  y,  z)  =  (y,  z  cos  yz,  y  cos  yz), 

(c)  H(x,y,z)  =  (y  +  yzcos(vyz),  x  +  vz  cos(vyz),  2z  +  vy  cos(vyz)), 

(d)  K(x,  y,  z)  =  (2vcos(v2  +  yz),  zcos(v2  +  yz),  y  cos(v2  +  yz)). 
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6.3  Let  f(x,y,z)  =  x2y2  +  y2z2 .  Verify  directly  that  V  x  V/ 

6.4  Compute  the  curl  of  each  of  the  following  vector  fields: 

(3,1,2) 


=  0. 


(a)  F i(x,  y,  z )  = 

(b)  F2(x,  y,  z )  = 


||X|| 

(yz,  zx,  xy) 


X 


(c)  F3(X)  = 


{X,X)X 


||X||4 

where  X  =  (x,  y,  z)  e  R3  \  {0}. 

6.5  If  a,  b,  c  and  d  are  vectors  in  R3  show  that 


(a)  abxc=caxb= 


<21 

02 

<23 

b\ 

b2 

b3 

c\ 

C2 

C3 

~  (a 

b)c 

c 

a  d 

c 

b  d 

• 

(c)  (a  x  b)  •  (c  x  d)  = 

6.6  Let/,  g :  M3  —>  R  and  F,  G:  M3  — >  M3  denote  smooth  functions.  Prove 

(a)  div(/F)  =  /div(F)  +  V/  •  F 

(b)  div(F  x  G)  =  curl(F)  G  -  F  curl(G) 

(c)  curl(/F)  =  V/  x  F  +  /curl(F). 

6.7  If  / :  U  (open)  cl3^I  has  continuous  second-order  derivatives  show  that 


d2  f  d2f  d2f 

dlv(V/)  =  7?  +  ^  +  77 


2  ' 


Symbolically  the  left-hand  side  has  the  form  V  •  V/  and  is  written  (for  this 
reason)  V2/.  (V2/  is  called  the  Laplacian  of  /  and  if  V2/  =  0  then  /  is 
called  harmonic). 


6.8  Show  that 


is  harmonic  on  M3  \  {0,  0,  0}. 


(x2  +  y2  +  z2)3/2 

6.9  If  /:  M+  —>  R  and  g(X)  =  f(\\X\\)  for  X  eM?  \  {0,  0,  0}  show  that 


V^(X)  = /'(||X||) 


X 


and  V*g(X)  =  f"(\\X\\)  +  ———f'(\\X 


X 


X 


Show  that  g  is  harmonic  on  M3  \  {0,  0,  0}  if  and  only  if 


A 


fir)  =  -  +  B 
r 


for  all  r  7^  0  in  R. 


Chapter  7 

The  Frenet-Serret  Equations 


Summary  We  discuss  curvature  and  torsion  of  directed  curves  and  derive  the  Frenet- 
Serret  equations.  Vector-valued  differentiation  and  orthonormal  bases  are  the  main 
tools  used. 

In  this  chapter  we  define  geometric  concepts  associated  with  a  directed  curve  and 
derive  a  set  of  equations — the  Frenet-Serret  equations — which  capture  the  funda¬ 
mental  relationships  between  them. 

We  begin  with  directed  curves  in  Mr  since  this  particular  case  exhibits  special 
features  not  present  in  higher  dimensions.  These  are  due  to  considering  a  one- 
dimensional  object  (the  directed  curve)  in  two-dimensional  space,  M2.  The  same 
phenomena  appear  in  Chap.  12  when  we  examine  a  two-dimensional  object  (an  ori¬ 
ented  surface)  in  three-dimensional  space,  M3,  and  the  same  underlying  principles 
are  present  when  we  introduce  torsion  later  in  this  chapter.  Moreover,  our  motivation 
and  interpretation  of  normal  curvature  (Chap.  16)  and  geodesic  curvature  (Chap.  18) 
are  based  on  our  study  of  curves  in  M2.  This  special  straightforward  case  deserves 
particular  attention  because  of  the  insight  it  provides  into  later  developments. 

Let  P  :  [a,  b]  ->  M2  denote  a  unit  speed  parametrization  of  the  directed  curve  T 
and  let  P(t)  =  (x(t),  y(0)  for  all  t  in  [a,  b].  At  P(t)  e  T  the  unit  tangent,  T (, t ),  is 
given  by 

T(t)  =  P\t)  =  (x'(t),y\t)). 

The  special  features,  mentioned  above,  imply  that  there  are  just  two  unit  vectors  in  M2 
perpendicular  to  T(t)  and  using  the  anticlockwise  (or  counterclockwise)  orientation 
of  M2  we  can  distinguish  between  them.  If  we  rotate  T(t)  through  +7r/2  in  an 
anticlockwise  direction  we  obtain  a  unit  vector  on  the  left-hand  side  of  the  direction 
of  motion  along  T  (Fig.  7.1).  We  call  this  unit  vector  the  unit  normal  to  T  at  P(t) 
and  denote  it  by  N(t).  In  coordinates 

N(t)  -  (-/(O,  V(0). 

We  have  ( T{t),T{t ))  =  1  and  differentiating  we  get,  by  the  product  rule, 
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Fig.  7.1 


T{t))  =  0  =  ( T\t ),  r(f)>  +  (T(t),  T\t)). 
at 

Since 

(T'it),  Tit))  =  (Tit),  T'it)) 


this  implies 


(T'it),  Tit))  =  0. 


Hence  T' it)  is  perpendicular  to  Tit)  and,  using  once  more  the  fact  that  M2  is  two- 
dimensional,  we  see  that  T'it)  is  parallel  to  Nit).  The  curvature  of  T  at  Pit)  is 
defined  as  the  unique  scalar,  Kit),  satisfying 


T\t)  =  ic(t)N(t). 


(7.1) 


In  terms  of  coordinates 

Kit)  =  (Kit)Nit),  Nit))  =  (T\t),  Nit)) 

=  y"(t))  ■  (~y'(t),x'(t)) 

=  y\t)x\t)  -  x"(t)y\t)  (7.2) 

for  all  t  e  [a,  b].  We  call  |/c(t)|  the  absolute  curvature  of  T  at  Pit)  and  note  that 

Wit)  I  =  lir'wil  =  11^(011.  (7.3) 

Example  7.1  Let  P:  [a,  b\  — >  M2  denote  an  arbitrary  parametrization  of  the 
directed  curve  T.  We  recall  from  Chap.  5  that  P  o  s~l :  [0,/]  — >►  T  is  a  unit  speed 
parametrization  of  T  where  /  is  the  length  of  T  and  s  is  the  length  function.  If 
Pit)  =  (x(t),  y(t)),  t  e  [a,  b],  then 


Pos  lit)  =  (xos  lit),yos  lit )),  t  g  [0,  /] 
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and,  moreover, 


1 


i 

^'(j-i(o)2  +  y(s_I(o)2) 1 


We  have 

{x  O  s_1)'(0  =  x'(s-1(0)  •  (s_1)'(r) 


and 

(xos~lY'(t)  =  x"(s_1(f))((s_1),(0)2  +  x(s~l(t))  •  ( s~lY'(t ) 

and  analogous  formulae  for  (y  o  s~1)' (t)  and  (y  o  Substituting  these  into 

(7.2)  and  simplifying  we  obtain  the  curvature  at  the  point  P(t),  t  e  [a,  b], 

m  =  (7.4) 

{[x\t)f  +  (/(0)2) 

If  r  is  the  graph  of  a  smooth  function  / :  [a,  b]  — >  R  directed  from  left  to  right 
then  P{t )  =  (t,  fit)),  t  e  [a,  b],  is  a  parametrization  of  V.  This  parametrization 
is  only  unit  speed  in  the  trivial  case  of  a  horizontal  line  (why?).  Since  x(t)  =  t  we 
have  x'(t)  =  1  and  x"(t)  =  0  and  as  y(t)  =  f(t)  we  obtain  y'(t)  =  f'(t )  and 
y"(t)  =  f"  it).  Hence  the  curvature  at  (V,  fit))  is 


f'it) 

/  \  3/2  ’ 

(i  +  fit)2) 


We  now  discuss  the  geometric  significance  of  curvature.  Let  P  denote  a  unit  speed 
parametrization  of  the  directed  curve  T.  For  simplicity  we  suppose  0  e  [a,  b],  the 
domain  of  definition  of  P .  If  t  is  close  to  zero  then 


p(o  =  p(  o)  +  p'm  +  p"(  o)E  +  g{t)t2 


(7.5) 


where  g(t)  — >  0  as  t  — >  0.  Since  P'(0)  =  T(0)  and  P"(0)  =  7^(0)  =  k(0)N(0) 
we  can  rewrite  this  as 


Pit)  =  Pi  0)  +  Ti0)t  +  K(0)W(0)y  +  g(t)t2. 


The  function 

t2 

Q(t)  =  P(0)  +  T(0)t  +  k(0)N(0)  — 


(7.6) 
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(a) 


(b) 


* - 1 - ► 


<?(o)  2h  i 


N 


Q( 0)  t  T 


Fig.  7.2 


is  the  best  quadratic  approximation  to  T  near  P  (0)  and  the  curve,  parametrized  by 
Q ,  has  the  same  tangent,  the  same  normal  and  the  same  curvature  as  T  at  P(0). 
By  translating  and  rotating  the  plane,  if  necessary,  we  can  suppose  P(0)  =  (0,  0), 
T( 0)  =  (1,0)  and  N( 0)  =  (0,  1).  This  implies  Q{t)  =  (t,  ic(0)t2/2)  and,  if  k(0)  / 
0,  then  one  of  the  two  situations  portrayed  in  Fig.  7.2  holds. 

Let  Ct  denote  the  circle  with  centre  ( at ,  bt)  which  passes  through  the  points 
Q(—t),  2(0),  Q(t).  By  symmetry  ( at ,  bt)  lies  on  the  y-axis,  hence  at  =  0  and  \bt\ 
is  the  radius  of  Ct.  As  t  tends  to  zero  the  circles  Ct  converge  to  a  circle  C  with  centre 
(0,  b)  and  radius  \b\.  This  is  the  circle  of  curvature  at  0(0)  to  the  directed  curve 
parametrized  by  Q.  Since  _  0  as  *  — >  0  it  can  easily  be  shown  that  C 

is  also  the  circle  of  curvature  to  V  at  P(0),  i.e.  the  circle  that  fits  closest  to  V  near 
P(0).  The  centre  (0,  b)  is  called  the  centre  of  curvature  of  T  at  P(0)  and  \b\  is  the 
radius  of  curvature. 


We  have 


bj  =  11(0,  bt)  -  {t,  K(0)r2/2)||2  =  t2  +  0 bt  -  K(0)t2 /2)2 . 


Hence 


b2  =  t2  +  b2  -  b,K(0)t2  +  ic(0)2t4/4 


and 


Letting  t  tend  to  zero  we  get  btc( 0)  =  1.  We  interpret  |  at  (0)  |  as 


1 


1 


\b\  radius  of  circle  of  curvature 


Since  the  sign  of  b  tells  us  on  which  side  of  T  the  circle  of  curvature  lies  and  bK  (0)  =  1 
we  have 
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Fig.  7.3 


K(0)  >  0  -<=>•  the  circle  of  curvature  and  the  normal  are 

on  the  same  side  of  Y  (Fig. 7. 2a) 
m  <  0  «<=>>  the  circle  of  curvature  and  the  normal  lie 

on  opposite  sides  of  r(Fig.7.2b). 

If  /c(0)  =  0  then  Y  is  rather  flat  near  P( 0)  and  the  circle  of  curvature  has  infinite 
radius  and  thus  is  a  straight  line — in  our  case  the  v-axis. 

We  have  thus  established  a  geometric  interpretation  for  both  absolute  curvature 
and  the  sign  of  curvature  in  M2  and  this  often  yields  immediate  and  useful  information. 
For  example,  consider  the  ellipse  in  Fig.  7.3,  oriented  in  an  anticlockwise  direction. 
The  normal  will  always  point  into  the  ellipse  and  is  called,  for  this  reason,  the  inner 
normal .  Since  the  circle  of  curvature  at  any  point  has  finite  radius  and  lies  on  the 
same  side  of  the  tangent  line  as  the  curve  the  curvature  is  always  strictly  positive. 

At  the  points  A  and  C  the  circles  of  closest  fit  to  the  ellipse  have  minimum  radii 
among  all  points  on  the  ellipse.  Hence  we  have  maximum  curvature  at  A  and  C  and, 
similarly,  minimum  curvature  at  B  and  D. 

Now  consider  a  directed  curve  Y  in  W1,  n  >  2,  with  unit  speed  parametrization 
P :  [ a,b ]  — >  T.  As  before  the  unit  tangent  to  Y  at  P(t)  is  P'(t)  =  T(t).  We 
cannot,  however,  define  the  unit  normal  to  Y  at  P(t)  by  rotating  T  (t)  since  there 
are  not  just  two  but  an  infinite  number  of  sides  to  Y  and  thus  an  infinite  number 
of  ways  of  choosing  a  unit  vector  perpendicular  to  T(t).  Even  if  there  were  only 
two  directions  we  would  require  some  concept  of  anticlockwise  direction  in  W:  to 
distinguish  between  them. 

To  define  a  normal  we  need  to  choose  a  direction  perpendicular  to  T  which  is 
associated  in  some  way  to  the  curve  Y.  We  still  have  (T(t),T(t))  =  1  for  all  t 
in  [a,  b ]  and  differentiating,  as  we  did  previously,  we  get  ( T'(t ),  T(t))  =  0.  We 
have  found  a  special  vector,  T'(t ),  perpendicular  to  T(t )  and  if  this  is  non-zero  (or 
equivalently  if  P"(t)  ^  0)  we  define  the  unit  normal ,  N(t),  by 

T\t) 


N(t)  = 


T'(t)  || 
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We  define  the  curvature  ic(t)  by  K(t)  =  ||r'(f)||.  Thus  the  normal  is  only  defined  at 
points  of  non-zero  curvature  and  at  such  points  we  obtain  the  equation 

T\t)  =K(t)N(t).  (7.10 

Note  that  (7.10  and  (7.1)  are  the  same  equation.  However,  (7.1)  applies  to  a  curve 
in  Mr  while  (7.10,  which  is  the  first  of  the  Frenet-Serret  equations  when  n  =  3, 
applies  to  a  curve  in  Mn .  The  definitions  of  curvature  and  normal  are  different  in 
these  two  equations.  The  technique  of  using  an  equation  in  a  simple  setting  to  extend 
a  definition  to  a  more  general  setting  is  standard  and  useful  in  mathematics. 

Now  that  we  have  defined  k  and  N  in  Mn  we  must  investigate  their  properties 
as  they  could  well  be  different  to  those  in  R2.  Using  the  same  terminology  is  an 
expression  of  our  aspirations  but  does  not  qualify  as  a  proof.  We  note  first  that 
curvature  in  Mn  is  always  defined  and  always  non-negative  but  certain  curves,  such 
as  straight  lines,  do  not  have  a  normal. 

Equation  (7.5)  and  the  approximation  (7.6)  are  still  valid  for  curves  in  Mn  and 
the  same  analysis  shows  that  K(t )  can  be  interpreted  as  the  reciprocal  of  the  radius 
of  the  circle  of  curvature  to  T  at  P(t).  Hence  curvature  in  W\  n  >  2,  has  the  same 
geometrical  interpretation  as  absolute  curvature  in  M2.  We  have  seen  that  the  sign 
of  curvature  in  M2  was  related  to  the  different  sides  of  a  curve  and,  in  view  of  our 
previous  remarks,  it  is  not  surprising  that  it  does  not  feature  when  n  >  2. 

From  now  on  we  restrict  ourselves  to  curves  in  M3 .  The  approximation 

t2 

Q(t)  =  P(0)  +  T  (0)t  +  k(0)N  (0)  y 

near  0  e  [a ,  b]  is  still  valid  and  shows  that  the  plane  (or  two-dimensional  subspace)  in 
M3  closest  to  the  curve  near  P(t)  is  the  plane  through  P  ft)  spanned  by  {T(t),  N(t)}. 
We  call  this  the  osculating  plane  of  the  curve  at  P  (t) .  We  may  consider  the  osculating 
plane  as  the  two-dimensional  analogue  of  the  tangent  line.  As  we  move  along  the 
curve  the  osculating  plane  will  generally  change  and  the  more  it  changes  the  more 
twisted  the  curve.  We  measure  this  by  defining  a  new  concept — torsion — which  we 
denote  by  r .  We  define  torsion  and  a  third  unit  vector,  the  binormal  in  M3 ,  in  a  fashion 
similar  to  the  way  we  introduced  curvature  and  the  normal  for  curves  in  M2.  At  each 
point  on  the  directed  curve  T  in  M3  we  have  obtained  two  perpendicular  unit  vectors 
T  and  N  and  these  span  a  two-dimensional  subspace  of  M3 .  Hence  there  are  precisely 
two  unit  vectors  perpendicular  to  T  and  N .  To  choose  one  of  these  unambiguously 
we  require  a  sense  of  direction  or  orientation  in  M3 .  This  will  also  be  important  in 
integration  theory.  The  basis  of  this  sense  of  direction  is  known  as  “the  right-hand 
rule”  and  we  describe  it  in  the  special  case  in  which  we  are  interested.  Use  the  right 
thumb  as  the  vector  T  and  the  first  finger  in  place  of  N .  Then  the  second  finger  will, 
when  put  perpendicular  to  T  and  N ,  give  the  direction  of  the  binormal ,  B  (Fig.  7.4). 

Think  of  T  and  N  as  determining  the  flat  plane  of  this  page.  This  page  has  two 
sides  and  thus  two  unit  vectors  perpendicular  to  it.  Since  the  vector  N  is  obtained  in 
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A  N 


T 

► 


Fig.  7.4 


Fig.  7.4  by  rotating  T  in  an  anticlockwise  direction,  B  will  be  the  unit  vector  on  this 
side  of  the  page.  Note  that  we  placed  T  before  N  in  this  construction. 

Mathematically,  we  can  find  the  binormal  by  using  the  cross  product  in  M3 .  For 
a  directed  curve  T  in  M3  with  parametrization  P ,  unit  tangent  T  (t)  and  unit  normal 
N(t )  the  binormal  at  the  point  P  it)  is  given  by 

B(t )  =  T(t)  x  N(0- 


To  derive  further  results  we  list  standard  properties  of  the  cross  product — all  of  which 
follow  easily  from  well-known  results  about  determinants. 

Let  v,  w  and  u  be  vectors  in  M3  then 


(a)  v  x  w  =  — w  x  v 

(b)  vxw/0  -<=>  v  and  w  are  linearly  independent 

(c)  v  x  w  is  perpendicular  to  both  v  and  w 

(d)  || v  x  w ||  =  || v ||  •  || w ||  •  |  sin0|,  where  0  is  the  angle  between  v  and  w 

(e) 


and 


U\ 

U2 

U  3 

u  •  V  X  w  = 

Vi 

V2 

V3 

W 1 

W2 

W3 

uvxw  =  wuxv  =  v-  wxu 


(f)  |  u  •  v  x  w  ||  =  volume  of  the  parallelepiped  with  adjacent  sides  v,  w  and  u 

(g)  hl(v-u)  x  (w  —  u)  ||  =  area  of  triangle  with  vertices  v,  w  and  u. 

By  (c),  B(t)  is  perpendicular  to  both  T (t)  and  N(t)  and,  by  (d), 

115(011  =  \\T(t)W  .  ||  TV  (Oil  ■  I  sin(7T/2)|  (sinceJXO  JL  N(t)) 

=  1. 


Hence  {T(t),  N(t),  B(t)}  consists  of  three  mutually  perpendicular  unit  vectors  in 
M3 .  In  particular,  they  are  linearly  independent  and  so  form  a  basis  for  M3 .  We  call 
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Fig.  7.6 

them  an  orthonormal  basis  (“ortho”  comes  from  orthogonal  or  perpendicular  and 
normal  comes  from  the  fact  that  they  are  unit  vectors).  Another  orthonormal  basis 
for  M3  is  the  set  {i,  j,  k}.  We  consider  {T  it),  Nit),  Bit)}  as  a  special  basis  which  is 
adapted  to  studying  the  curve  T  near  Pit).  As  t  varies  the  basis  {T  it),  N  it),  Bit)} 
changes  and  is  called  a  moving  frame  along  the  curve  (see  Fig.  7.5). 

Since  N  (t)  x  Bit)  is  easily  seen  to  be  a  unit  vector  perpendicular  to  both  N  and 
B  we  must  have 

N(t)  x  B{t)  =  ±T(t). 


Hence 


Tf)  •  Nf)  x  Bf)  =  ±Tit)  •  Tf)  =  ±1 

II  by  (e) 

Bit)  ■  T(t )  x  N{t )  =  Bit)  ■  Bit )  =  1 

and  N  it)  x  Bf)  =  T  it).  Similarly  Bf)xTit)  =  N  it)  and,  using  (a),  the  remaining 
cross  products  involving  T ,  N  and  B  can  be  found. 

The  simplest  way  to  remember  these  is  to  use  the  diagram  shown  in  Fig.  7.6.  The 
cross  product  of  any  two  taken  in  an  anticlockwise  direction  is  the  one  that  follows 
it,  e.g.  N  x  B  =  T .  If  we  work  in  a  clockwise  direction  we  get,  from  (a),  the  negative 
of  the  following  one,  e.g.  N  x  T  =  —  B. 

We  are  now  in  a  position  to  make  effective  use  of  the  orthonormal  basis  {T,  N,  B}. 
Since  it  is  a  basis  any  vector  v  can  be  written  in  the  form 


y  =  olT  +  /3N  +  yB 
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for  some  real  numbers  a ,  /3  and  y.  If  we  take  the  inner  product  of  both  sides  with 
respect  to  T  then 


(y,  T)  =  a(T ,  T)  +  fi{N,  T)  +  y(B,T)=a 

1  0  0 


Similarly  /3  =  (v,  N)  and  y  =  (v,  B)  and  thus 

v  =  (v,  T)T  +  (v,  N)N  +  (v,  B)B .  (7.7) 

We  also  note,  although  we  do  not  require  it  here,  that  Pythagoras’  Theorem  implies 

II v||2  =  (\,T)2  +  (\,N)2  +  (\,  B)2. 

We  use  vector- valued  differentiation  to  find  Since  | Bit) |  =  1  we  have 

( B(t ),  Bit))  =  1  and  hence 


0  =  ^-(B(t),  Bit))  =  ( B\t ),  Bit))  +  (Bit),  B\t)) 
at 

=  2 (B' it),  Bit)). 


(7.8) 


Since  (Bit),  T it))  =  0  we  get,  in  the  same  way, 


d 

dt 


(Bit), Tit)) 


=  ( B\t ),  Tit))  +  (Bit),  T\t)) 

=  (B\t),  Tit))  +  (Bit),  Kit)Nit))  (by  (7.1')) 
=  (B' it),  T it))  +  Kit) (Bit),  Nit)) 

=  ( B'it ),  T{t))  (since#  _L  N). 


Replacing  v  by  B'it)  in  (7.7)  and  using  (7.8)  and  (7.9)  we  have 

B'it)  =  (B\t),  Nit)) Nit) 


(7.9) 


i.e.  B'it)  is  parallel  to  N it).  We  use  this  equation  to  define  the  torsion  of  T  at  Pit  ), 
tit),  by  letting1 

Tit)  ^ -(B'it),  Nit)) 


and  obtain  another  of  the  Frenet-Serret  equations 

B'it)  =  -r  it)  Nit).  (7.10) 


i 


By  introducing  one  minus  sign  here  we  avoid  many  more  minus  signs  later. 
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We  discuss  the  geometric  significance  of  torsion  in  Chap.  8  but  note  at  this  point 
that  B(t)  is  normal  to  the  osculating  plane  and  hence  torsion  is  a  measure  of  the  rate 
of  change  of  the  osculating  plane. 

We  have  found  T '  and  B'  in  Eqs.  (7.1/)  and  (7.10)  for  a  directed  curve  in  M3  and 
the  remaining  Frenet-Serret  equation  is  an  expression  for  Nf  in  terms  of  the  basis 
{T,  N,  B}.  To  find  N!  we  differentiate  the  equation  N  =  B  x  T  using  the  product 
rule.  We  have 

Nf  =  B'  xT  +  B  xTf  =  -xN  x  T  +  B  x  kN  =  rT  x  N  —  kN  x  B. 

Since  T  x  N  =  B  and  N  x  B  =  T  we  obtain  for  all  t  the  equation 

N\t)  =  —ic(t)T(t)  +  x(t)B(t).  (7.11) 

Equations  (7.E),  (7.10)  and  (7.1 1)  which  express  T\  N!  and  Bf  in  terms  of  T ,  N  and 
B  are  known  as  the  Frenet-Serret  equations  and  contain,  for  all  practical  purposes, 
complete  information  on  the  curve  (see  also  Example  8.5).  The  set  [T,  N,  B,  k,  x] 
is  known  as  the  Frenet-Serret  apparatus  of  the  curve  T.  It  is  important  to  remember 
that  whenever  we  discuss  normals  to  curves  in  M3  or  the  Frenet-Serret  equations  for 
a  curve  we  assuming  the  curvature  is  strictly  positive. 

The  Frenet-Serret  equations  are  easily  remembered  when  expressed  in  matrix 
form 


Certain  classical  curves  are  not  covered  by  Definition  5.1  and  we  encounter  one  of 
these — the  helix — in  our  next  example.  In  its  natural  state  the  helix  does  not  have  an 
initial  or  final  point  and  is  usually  parametrized  over  R.  The  reader  should  have  little 
difficulty  analysing  such  curves  by  the  methods  we  have  already  developed.  Essen¬ 
tially  one  examines  different  finite  parts  of  the  curve  in  turn — for  example  consider 
how  we  treated  a  part  of  the  helix  in  Example  5.2.  For  the  sake  of  completeness, 
however,  we  mention  how  our  definition  of  parametrized  curve  can  be  extended  in 
the  following  natural  way  to  include  such  curves: 

a  continuous  mapping  P  :  I  — >  ,  I  an  interval  in  R,  is  a  parametrized  curve 
if  for  every  closed  interval  [a,  b]  C  I  the  restriction  of  P  to  [a,  b ]  satisfies  (a),  (c) 
and  ( d )  of  Definition  5.1. 

Example  7.2  Let  T  denote  the  helix  parametrized  by 


P(t)  ~  (r  cos  cot,  r  sin  cot,  hoot),  — oo  <  t  <  +oo 
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where  co  =  (r2  +  h 2)  1//2  and  r,  /t  and  are  all  positive.  Since 


P' it)  =  ( — r  co  sin  cot ,  r co  cos  cot ,  hco) 


and 


||P'(OII  =  (r2oj2  sin2  cot  +  r2oo2  cos2  cot  +  h2co2)1^2 
=  ( r2co 2  +  h2co2)l/2  =  coir2  +  h2)l/2  =  1 

the  parametrization  is  unit  speed  and  T  it)  =  P' it).  We  have 


Pn  it)  =  T'(t)  ~  (—rcoz  coscot,  —rcoz  sin  cot,  0) 


and 


K(t)  =  ||r'(OII  =  (r2co4  cos2  cot  +  r2co4  sin2  cot)1^2  =  co2r. 


Note  that  Y  is  not  a  circle  but  has  constant  curvature.  Hence  ic(t)  >  0  and 


N(t)  = 


T\t) 


1 


Tf(t)  ||  co2r 
=  (—  cos  cot ,  —  sin  cot ,  0) 


9  9 

(— r<z>  cos  cot,— r  co  sin<ztf,0) 


We  have 


B{t)  =  r(f)  x  N(f)  = 


i  j  k 

— r co  sin  cot  rco  cos  cot  hco 

—  cos  cot  —  sin  cot  0 


=  (hco  sin  cot,  —hco  cos  cot ,  r<z>) 


and 


/  /  7  • 

5  (7)  =  (/z&>  coscot,  hco  sin  cot,  0), 


Since  r(t)  =  —  (Bf(t),  N(t))  this  implies 


r  it)  =  —[ihoo2  coscot,  hco2  sin  cot,  0),  (—  cos  cot,  —  sin  cot,  0) 


9  9  9*9  9 

=  hco  cos  cot  +  hco  sin  cot  =  hco  . 


We  have  calculated  the  Frenet-Serret  apparatus  for  the  helix.  In  doing  so  we  used 
two  of  the  Frenet-Serret  equations.  The  third  equation 


N'  =  -kT  +  tB 
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may  be  used  to  check  our  calculations.  The  Frenet-Serret  apparatus  was  found  in 
the  following  sequence:  first  check  that  P  is  unit  speed  then 


{  T(t)  , 

K(t) 

N(t) 

B(t) 

x(t)  J 

P'(t) 

unoii 

T'(t)/K(t ) 

T (/)  x  N(t) 

Other  sequences  are  also  possible  but  the  above  appear  to  be  generally  more  direct 
(for  unit  speed  curves). 


Exercises 


7. 1  Parametrize  the  curve  x 2  +  (  v /3)2  =  9  with  an  anticlockwise  orientation  and 
hence  find  its  curvature.  Find  the  points  where  the  curvature  is  a  maximum 

(a)  by  inspecting  a  sketch; 

(b)  by  differentiating  the  curvature  function; 

(c)  by  inspection  of  the  curvature  function. 

7.2  Let  / :  U  (open)  c  M2  — >  M  and  suppose  /-1( 0)  has  full  rank  at  each  point. 
If  /-1  (0)  can  be  parametrized  as  a  directed  curve  in  Mr  show  that  its  absolute 
curvature  equals 


(ft  +  /v2)3/2 


Using  this  result  find  the  curvature  at  all  points  on  the  ellipse  ( x/a )2  + 
( y/b )2  =  1  directed  so  that  the  normal  points  outwards.  Verify  your  answer 
when  a  =  3  and  b  =  9  using  Exercise  7.1. 

7.3  Let  T  denote  the  plane  curve  parametrized  by 

P(t)  =  (t,  log  cos  t),  —7t/4<t<7t/4. 


Show  that  T  has  curvature  —  cos  t  at  P(t). 

7.4  Show  that  a  directed  curve  in  M3  is  a  straight  line  if  and  only  if  all  its  tangent 
lines  are  parallel. 

7.5  Show  that  each  of  the  following  gives  a  unit  speed  parametrization  of  a  curve 
T  in  M3.  Calculate  the  Frenet-Serret  apparatus  of  the  curve  and  verify  that 
N'  =  —kT  +  xB. 


(a)  P(t)  = 

(b)  P(t)  = 

(c)  Pit)  = 


/(l  +03/2  (l  -03/2  t  \ 

t(cos-1(0  -ty/i  -t2,  1  -  t2,  o),  0  <t  <  1/2 
/(1+f2)1/2  2 1  log(r  +  vT+T2)^ 

\—r  ’Ji’ - 7? - )’  teR- 
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7.6  Let  P  ( t )  =  ( a  cos  t,  a  sinL  at  tan  a)  denote  a  parametrization  of  a  helix  where 
0  <  cos  a  <  a  <  1 .  Find  a  unit  speed  parametrization.  Show  that  the  centre  of 
curvature  also  moves  on  a  helix  and  find  the  cylinder  on  which  this  helix  lies. 

7.7  If  a  and  b  are  positive  real  numbers  show  that  the  curve  parametrized  by 


P(t)  = 


^ a  cos  t,  a  sin  t,  b  cosh 


t  e  R 


lies  on  the  cylinder  x2  +  y2  =  a2.  Show  that  the  osculating  plane  at  any  point 
on  the  curve  makes  a  constant  angle  with  the  tangent  plane  to  the  cylinder  at 
that  point. 

7.8  If  P  :  [a,  b]  — >►  V  is  a  unit  speed  parametrized  curve  show  that 

(. P '  X  P ",  P'")  =  k2x 


and  if  r  7^  0  show  that 

^  _  (P'  x  P\  Pm) 
r  ~  Z p ",  p ") 


7.9  A  plane  in  ]R3  (or  a  cross-section  of  M3)  consists  of  all  points  (x,y,z)  satisfying 
a  linear  equation  ax  +  by  +  cz  =  d  where  at  least  one  of  a,  b,  c  is  non-zero. 
Find  A  e  M3  and  a  e  R  such  that  the  plane  above  coincides  with 

{A  g  M3;  A  •  X  =  o'}. 

Find  a  when  the  plane  goes  through  the  origin. 

7.10  Let  P  denote  a  unit  speed  parametrization  of  a  directed  curve  in  M3  with  non¬ 
zero  curvature  at  F(0).  Show  that  the  equation  of  the  osculating  plane  at  P(0) 
is 

{X  g  M3  :  (X  -  P( 0))  x  P\ 0)  •  P"( 0)  =  0}. 

7.1 1  Let  P :  [0,  /]  — >►  M3  denote  a  unit  speed  parametrization  of  a  directed  curve  T, 
with  positive  curvature  and  torsion.  For  t  e  [0,  /]  let 

Q(t )  =  f  B(x)dx. 

Jo 

Show  that  Q  defines  a  unit  speed  parametrization  of  a  directed  curve  f.  If 
{T,  N,  B,  k,  r }  is  the  Frenet-Serret  apparatus  for  T  show  that  {B,  —N,  T ,  r,  k] 
is  the  Frenet-Serret  apparatus  for  f1 . 

7.12  Let  T  denote  a  directed  curve  in  M3  with  positive  curvature  at  all  points  and 

suppose  P  :  [a,  b]  — >  M3  is  a  unit  speed  parametrization  of  T.  Using  the 
Frenet-Serret  equations  and  the  identity  axbc  =  cxab  find  a  mapping 
X  :  [a,  b]  — >  M3  such  that  T'(t )  =  X(0  x  N'(t )  =  X(0  x  A(0  and 

=  X(r)  x  B(t)  for  all  t  e  [a,  b]. 


Chapter  8 

Geometry  of  Curves  in  M3 


Summary  We  apply  the  Frenet-Serret  equations  to  study  the  geometric  significance 
of  torsion,  to  analyse  curves  in  spheres  and  to  characterise  generalised  helices. 

We  first  provide  a  geometrical  interpretation  of  zero  torsion. 

Proposition  8.1  If  T  is  a  directed  curve  in  M3  with  positive  curvature  at  all  points 
then  the  following  are  equivalent 

(a)  r  is  a  plane  curve. 

(b)  the  function  t  —>  B(t)  is  constant. 

(c)  r(t)  =  Ofor  all  t. 

Proof  Since  ic(t)  >  0,  N(t)  is  defined.  By  the  Frenet-Serret  equations  B'(t )  = 
—r(t)N(t)  and  since  ||A/X0II  =  1  we  have: 

B(t )  is  independent  of  t  <<=>►  B'(t)  =  0  <<=>>  r(t)  =  0. 


Hence  (b)  and  (c)  are  equivalent.  Now  suppose  f  is  a  plane  curve,  i.e.  there 
exists  a  plane  in  M3  which  contains  r.  Then  there  exists  a  unit  vector  A  in  M3  and 
a  real  number  c  such  that  r  C  {X  g  M3  :  X  •  A  =  c}.  Let  P  denote  a  unit  speed 
parametrization  of  r  with  domain  [a,  b].  For  al  It  e  [a,  b].  Pit)  •  A  =  c.  We  have 


d 

dt 


i Pit )  •  A)  =  T {t)  •  A  =  0 


and 


d2 

dt2 


P{t)  •  A  =  T\t)  •  A  =  ic(t)N(t)  A  =  0. 


Hence  A  is  perpendicular  to  both  T  (t)  and  N(t)  and  B(t)  =  zb  A  for  all  t.  Suppose 
B(t\)  =  A  and  B(tz)  =  —A.  The  mapping  g  :  t  — >  B(t)  •  A  is  a  continuous  real¬ 
valued  function  on  [a,  b],  g(t)  =  ±1  for  all  t,  g(t\)  =  1  and  gfe)  =  —  1.  This 
is  impossible,  since  it  would  imply,  by  the  Intermediate  Value  Theorem,  that  there 
exists  to  e  ( a ,  b)  such  that  g(to)  =  0.  Hence  g(t)  =  +1  for  all  t  or  g(t)  =  —1  for 
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all  t  and  this  implies  B(t )  =  A  for  all  t  or  B(t)  =  —A  for  all  t .  In  either  case  B  is  a 
constant  function  and  (a)  =>►  (b). 

Now  suppose  (b)  holds.  Let  P  denote  a  unit  speed  parametrization  of  r  with 
domain  [a,  b].  If  to  £  (<z,  b),  Xo  =  P(to)  and  Bit)  =  B  for  all  t,  then 

^(P(t)  -  X0,  B)  =  ( P\t ),  B{t))  +  (P(t)  -  X0,  0) 
at 

=  (Tit),  Bit)) +0 
=  0  (since  Tit)  _L  Bit)). 

Hence  there  exists  a  constant  c  such  that 


(Pit)  ~  Xq,  B)  =  c 


and  P  lies  in  the  plane  through  Xo  perpendicular  to  B.  This  shows  that  (b)  =>►  (a) 
and  completes  the  proof.  □ 

Proposition  8.1  gives  a  precise  geometric  interpretation  of  zero  torsion.  To  inter¬ 
pret  non-zero  torsion  we  look  at  an  expansion  of  the  parametrization  P  about  a  fixed 
point  relative  to  the  basis  {T ito) ,  N ito) ,  B ito)} .  For  convenience  we  may  suppose 
to  =  0. 

In  Chap.  7  we  obtained,  using  orthogonality,  the  Taylor  series  expansion  and  the 
Frenet-Serret  equations,  the  following  three  expansions  of  Pit): 


Pit)  =  (Pit),  T(0))T(0)  +  (P(t),  Ni0))Ni0)  +  (P(t),  Bi0))Bi0)  (8.1) 

t2 

=  P(  0)  +  P'm  +  P"(  0)-  +  g(t)t 2  (8.2) 

k(0)N(0)  ,  , 

=  P(0)  +  T(0)t  +  ;2  t 2  +  g(t)t2  (8.3) 


where  git)  0  in  M3  as  t  0.  From  (8.3)  we  can  identify  the  main  influence — the 
first  non-constant  term  in  the  Taylor  series  expansion — on  the  shape  of  the  curve  in 
the  T (0)  and  N( 0)  directions.  Comparing  (8. 1-8.3)  we  see  also  that  the  first  possi¬ 


ble  non-zero  term  in  the  Z?(0)  direction  will  be  — ~(P(t),  Bi 0)) 

dt 5 


.  By  repeated 

t= o 


differentiation  and  use  of  the  Frenet-Serret  equation  for  T!  we  obtain 


(P\t),Bi0))  =  (: 7\f),  5(0)) 

(P\t),  Bi 0))  =  (T\t),  Bi 0))  =  (Kit)Nit),  Bi 0)) 


and,  using  the  Frenet-Serret  equation  for  N\ 

( Pmit ),  BiO))  =  (K\t)Nit)+Kit)N'it),  BiO)) 

=  (K'it)Nit)  +  K(t)(—K(t)T(t)  +  x it) Bit)),  BiO)). 
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Letting  t  =  0  we  get 

(P"'( 0),  5(0))  =  k'(0)(N(0),  5(0))  -  k2(0)(T(0),  5(0)) 

+  k(0)t(0)<5(0),5(0)> 

=  k(())z(0) 

since  {T,  N,  B]  are  mutually  perpendicular  unit  vectors.  This  gives  us  the  approxi¬ 
mation 

t 2  t3 

Q(t )  =  5(0)  +  5(0)f  +  k(0)N(0)-  +  k(0)t(0)5(0)- 

2  6 

called  the  Frenet  approximation  to  the  curve  r  at  0.  The  Frenet  approximation  is 
clearly  a  refinement  of  (8.3)  which  takes  account  of  torsion.  The  function  t  ->  Q(t) 
defines  a  parametrized  curve  which  has  the  same  Frenet-Serret  apparatus  as  the 
original  curve  at  P  (0) .  From  the  Frenet  approximation  we  see  the  influence  of  non¬ 
zero  torsion  on  the  shape  of  the  curve.  Torsion  controls  the  motion  of  the  curve 
orthogonal  to  the  osculating  plane.  If  r(0)  >  0  then  the  curve  twists  towards  the 
side  of  the  osculating  plane  which  contains  Z?(0)  and  the  greater  r(0)  the  more 
dramatic  the  twist.  If  r(0)  <0  the  curve  twists  towards  —  Z?(0). 

An  everyday  example  of  non-zero  torsion  is  given  by  the  curve  on  the  edge  of  a 
screw.  In  tightening  a  screw  one  usually  uses  the  right-hand  and  follows  the  right- 
hand  rule  while  in  loosening  a  screw  one  follows  the  left-hand  rule.  If  you  have 
any  doubts  about  the  difference  change  hands.  This  also  illustrates  the  two  different 
orientations  of  M3 . 

Example  8.2  In  this  example  we  study  a  directed  curve  r  which  lies  in  a  sphere 
with  centre  c  and  radius  r.  Let  P  denote  a  unit  speed  parametrization  of  r .  Our 
hypothesis  states  that 

II 5(0  -  e||2  =  (5(f)  -  c,  5(f)  -  c)  =  r2. 

Consider  the  expansion  of  5(f)  —  c  relative  to  the  orthonormal  basis  {5(f), 
N(t),  5(f)},  i.e. 

5(f)  -c=  (5(f)  -  c,  T (f))5(f)  +  (5(f)  -  c,  N{t))N(t) 

+  (5(f)  —  c,  5(f))5(f).  (8.4) 

Differentiating  we  get 

-j-(5(f)  -  c,  5(f)  -  c)  =  0  =  2(5(f)  -  c,  P'(t)} . 
at 

Since  5  is  unit  speed,  P'(t )  =  5 (f),  and  we  may  restate  this  as  follows: 


(5(f) -c,  5(f))  =  0. 


(8.5) 
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Differentiating  again  and  using  the  Frenet-Serret  equation  for  T'  gives  us 

0  =  -  c,  T(t ))  =  (T(t),  T(t ))  +  (P(t)  -  c,  T\t )> 

at 


Hence  K(t )  ^  0  for  all  t,  N(t)  is  defined  and 


<^(0 


c,  N (t)) 


1 

K(t)' 


(8.6) 


Differentiating  (8.6)  and  applying  the  Frenet-Serret  equation  for  N!  we  obtain 

=  (Tit),  Nit))  +  (Pit) -c,  N'(t )) 

=  (P(t)  -  c ,  - K(t)T(t )  +  r (t)B(t))  (since  N  L  T) 

=  T (t)(P(t)-c,  Bit))  (by  (8.5)). 


If  r(fo)  7^  0  for  some  /q  then,  by  continuity,  rd)  0  for  all  t  near  to  and 


(P(t)-c,  B(t)) 


Substituting  (8. 5-8. 7)  into  (8.4)  we  get 


Pit)-c  =  - 


(B.7) 


and  we  have  found  Pit)  in  terms  of  the  Frenet-Serret  apparatus  of  r.  By  Pythagoras’ 
theorem 

r2  =  ||P(f)  -  c||2  =  +  ( (— V  .  J_)  . 

Kit)2  \G(r)/  x  it)) 

Hence  we  have  recovered  the  radius  of  the  sphere  from  the  curvature  and  torsion 
when  we  know  that  the  curve  lies  in  a  sphere.  In  particular,  we  see  that  r2  >  l  MO2, 
or  K{t)  >  1/r,  which  we  may  loosely  rephrase  as  saying  that  a  curve  in  a  sphere  is 
at  least  as  curved  as  the  sphere  in  which  it  lies. 

If  r  (t)  =0  for  all  t  in  an  open  interval  I  then  the  above  implies  that  k  (t )  is  constant 
on  /.  Since  a  plane  circle  has  constant  curvature,  Proposition  8.1  and  Example  8.5 
imply  that  the  part  of  r  parametrized  by  restricting  P  to  I  is  part  of  a  circle  contained 
in  a  plane  in  M3 . 

Example  8.3  The  helix  in  Example  7.2  satisfies  {Tp,  (0,  0,  1))  =  hco  for  all  p  where 
Tp  is  the  unit  tangent  at  the  point  p.  We  generalise  this  by  defining  a  generalised 
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helix  in  M3 ,  as  a  curve  7  of  positive  curvature  (at  all  points)  for  which  there  exists  a 
unit  vector  u  in  M3  such  that 


{Tp,  u)  =  c  (constant) 

for  all  p  6  r.  Let  P  denote  a  unit  speed  parametrization  of  the  generalised  helix 
r  and  let  {7(0,  N(t),  B (t) ,  k (t) ,  r  (t)}  denote  the  Frenet-Serret  apparatus  at  P(t). 
We  prove  the  following  characterisation: 

r(t) 

r  is  a  generalised  helix  - is  constant  (i.e.  independent  of  t). 

K{t) 

Since  (7(0,  u)  is  constant  it  follows,  by  the  Cauchy-Schwarz  inequality 
(Example  3.4),  that 

|(r(0,u)|  <  117(011  ■  ||u||  <  1 

and  (7(0,  u)  =  cos  6  for  some  0.  If  6  =  rut  then,  by  the  equality  case  in  the 
Cauchy-Schwarz  inequality,  7  (0  =  ±u.  By  the  Intermediate  Value  Theorem  (see 
the  proof  of  Proposition  8.1)  this  implies  that  7(0  is  a  constant  function  of  t.  Hence 
T'(t)  =  0  and  this  contradicts  our  hypothesis.  We  thus  have  (7 (0,  u)  =  cos  6  for 
some  6  /  rut.  Again  using  the  orthonormal  basis  {7(0,  N(t),  B(t)}  we  have 

u  =  (u,  7 (0)7(0  +  (u,  N(t))N(t)  +  (u,  B(t))B(t). 

Since  (7(0,  u)  =  cos  6 

jt{(T(t ),  u>)  =  0  =  (T'(t),  u)  +  (T(t),  (u)'}  =  (K(t)N(t),  u) 

(since  u  is  a  constant,  ( u )'  =  0).  By  our  hypothesis  /c(0  7^  0  and  this  implies 
(u,  N(t))  =  0.  Hence 

u  =  cos  ^7(0  +  sin  0B(t). 

A  further  application  of  the  Frenet-Serret  equations  implies 

0  =  — u  =  cos  0T'(t)  +  smOB'(t) 
dt 

=  cos6K(t)N(t)  —  sin6r(t)N(t) 

=  (cos^/c(0  —  sin0r(O)A/XO- 
Since  ||V(0ll  =  1  this  implies  cos$/c(0  =  sin0r(O  and 

r(0  cos  0 

=  cot  0 . 


/c(0  sin^ 
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We  have  shown  x  (t)/ic(t)  is  constant  for  any  generalised  helix.  In  obtaining  this 
result  we  obtained  a  formula  for  u  and  now  use  this  to  prove  the  converse.  Let  r 
denote  a  directed  curve  with  non-zero  curvature  in  M3  such  that  r  it) /  Kit)  is  constant 
(i.e.  independent  of  t).  This  hypothesis  implies  that  there  exists  a  real  number  9, 
0  <  0  <7 r,  such  that  cotO  =  r (t)/ic(t)  for  all  t  (note  that  0  does  not  depend  on  t). 
Let 

u (t)  =  cos  6  T(t)  +  sin  6  Bit). 


By  Pythagoras’  theorem  ||u||2  =  cos2  6  +  sin2  0  =  1  and  u  is  a  unit  vector.  To 

show  that  u  does  not  depend  on  t  we  prove  —  (u (f))  =  0.  By  the  Frenet-Serret 

dt 

equations 


—  (u (t))  =  cos 0  T'(t )  +  sinO  Br{t) 
d  t 

=  (a :(t)  cos  9  —  t  (t)  sin  9)N(t)  =  0. 

Hence  u  does  not  depend  on  t  and  so  is  a  constant.  Moreover,  since  T  _L  B, 

( T (t),  u)  =  (T (t),  cos 0  T (t)  +  sin#  B(t))  =  cos 9. 

This  shows  that  r  is  a  generalised  helix  and  justifies  our  claim. 

Our  analysis  so  far  applies  to  unit  speed  parametrizations  of  a  directed  curve. 
Unfortunately,  many  natural  parametrizations  of  curves  are  not  unit  speed.  It  is  thus 
useful  to  be  able  to  calculate  the  Frenet-Serret  apparatus  directly  from  an  arbitrary 
parametrization. 

Let  P  :  [a ,  b]  — >  M3  denote  an  arbitrary  parametrization  of  the  directed  curve 
r.  We  suppose  P'(t)  and  P"{t)  are  both  non-zero  for  all  t.  Let  s  :  [a,  b]  — >  [0,  /] 
denote  the  length  function  associated  with  P  (see  Chap.  5).  Then  /  is  the  length  of 
r,  \\P'(t)\\  =  s'(t )  and  Q  :=  P  o  s~l  is  a  unit  speed  parametrization  of  r.  Let 
{T(t),  N(t ),  B(t),  K(t),  r(t)}  denote  the  Frenet-Serret  apparatus  at  the  point  P(t) 
on  r.  We  have 

Q  o  s(t)  =  P  o  s~]  o  s(t)  =  P{t). 

Hence 

=  Q'{s(t))s'(t )  =  P\t)  =  \\P\t)\\T{t) 
and 

P'(t ) 


G'(5(0)  =  r(0 


P'COII 


(B.8) 
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Differentiating  again 

|rK«»)  =  s(e'<s(,»  /(<)) 

=  Q"{s{t))(s\t))2  +  Q\s{t))s'\t)  =  P"{t). 
Since  Q  has  unit  speed  the  Frenet-Serret  equations  imply 

Q"(s{t))=K{t)N{t) 

and 

P"(t)  =  (s'{t))2K(t)N{t)+s"{t)T(t). 


Hence 


P'{t)  x  P"(t)  =  s'(t)T{t)  x  (s'{tj1K{t)N(t)+s"(t)T{t)\ 


since  T  x  N  =  B  and  T  x  T  =  0.  Since  ||  B(t)  ||  =  1  and  s'(t)  = 
implies 


P'(t)xP"(t)  11 

ll^'WII3 


P'(t)\\  this 
(8.9) 


and 


P\t)  x  P"(t) 

p'(t)xp"(t)  ir 


(8.10) 


The  simplest  way  to  obtain  N  is  to  use  the  formula 


N  =  B  xT. 


(8.11) 


The  Frenet-Serret  equation  for  Nf  implies 

=  ( KN)\t )  =  ic'(t)N(t)+K(t)N'(t) 

=  Kr(t)N(t)  +  K(t){— K(t)T  (t)  +  r(t)B(t)) 


and  hence 


{Q"'(s(t)),B(t))  =K(t)r(t). 
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On  the  other  hand 
£(<*"»)  =  '■"» 

=  Q'"(s(t))s'(t)3  +  3  Q”(s{t))s'{t)s"{t)  +  Q'(s(t))sm(t) 
=  Q"'(s(t))s'(t)3  +  lK(t)s\t)s"{t)N(t)  +  s"'(t)T  ( t ). 


By  orthogonality 

I  ...  P'  (t)  X  P"(t)  \  ...  .  n  ...  ,  v 

[P ||p4)x/4)I|)  =  {P  (r)’  S(0>  =  s  «  <2  S(f)) 

=  ||P'(0llVM0. 


Finally 


(P"'(0,  P'(0  X  P"(r)> 

p'(0xP"(0IH|P'(0II3^(0 


(P"'(0,  P'(0  X  p"(0> 

||P'(0  x  p"(0ll2 


(8.12) 


Equations  (8.8-8.12)  are  the  Frenet-Serret  apparatus  at  P(t)  for  r  in  terms  of 
the  parametrization  P . 

Example  8.4  Calculate  the  Frenet-Serret  apparatus  of  the  curve  parametrized  by 


P(t)  =  (t  —  cos  t,  sin  t,  t). 


We  first  calculate  the  required  derivatives  of  P;  P',  P"  and  P,n .  We  have  P'(f)  = 
(1  +  sin  t,  cos  t,  1),  P"(t)  =  (cos  t,  —  sin  t,  0)  and  P"'(t)  =  (—  sin  t,  —  cos  t,  0). 
Next,  we  obtain  the  cross  product 


P\t)  x  P\t)  = 


l 

1  +  sin  t 
cos  t 


j  k 

cos  t  1 
—  sin  t  0 


(sin  t,  cos  t,  —  sin  t  —  1) 


and  finally  the  norms  or  lengths 

||P/(7)||  =  (1  +  2sinf  +  sin2  t  +  cos2  t  +  1 )^2  =  (3  +  2  sin  t)1^2 
||  P'(t)  x  Prr{t)  ||  =  (sin2  t  +  cos2  t  +  sin2  t  +  2  sin  t  +  1)  ^2 

=  (2  +  2  sin  t  +  sin2  t )^2. 


Hence 


P\t)  (1  +  sinf,  cosf,  1) 
P'(0ll  _  (3  +  2  sin t) 1/2 


Tit)  = 
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K(t) 

B{t) 
*  it) 


\\P'it)  x  P"it)  ||  (2  +  2sinf  +  sin2  f)1/2 

||P'(OII3  “  (3  +  2sinf)3/2 

Pf(t)  x  Pff(t)  (sinG  cos  t,  —  sint  —  1) 

||  5'(0  x  5"(f)||  (2  +  2sinf  +  sin2  f)1/2 

(P"'(0,  5'(0  X  p"(0> 

||5'(0  x  p"(OII2 

(—  sin  t,  —  cos  t,  0)  •  (sin  G  cos  t,  —  sin  t  —  1) 

2  +  2  sin  f  +  sin2  t 

-1 

2  +  2  sin  t  +  sin2  t 


Nit)  =  5(0  x  Tit) 


1 


a/3  +  2  sin  */2  +  2  sin  f  +  sin2  f 
(2  cos  f  +  sin  f  cos  G  —  1  —  3  sin  t  —  sinz  t,  —  cos  t) 


1  J 

sin  t  cos  f 
1  +  sin  t  cos  f 

2 


(6  +  10  sin  t  +  7  sin2  f  +  2  sin3  t) l/2 


k 

—  sin  t  —  1 
1 


Example  8.5  In  Example  8.2  we  showed  that  curvature  and  torsion  together  allowed 
us  to  deduce  properties  of  spherical  curves.  In  this  example  we  show  that  curvature 
and  torsion  completely  determine  the  shape  of  a  curve  in  M3 .  Let  T  and  T\  denote  two 
directed  curves  having  the  same  length  /  in  M3 .  We  suppose  that  both  have  positive 
curvature.  Now  transfer  E\  so  that  its  initial  point  coincides  with  the  initial  point  of 
r  and  rotate  it  so  that  the  tangents,  normals  and  binormals  of  T  and  T\  coincide  at 
the  initial  point.  These  operations  do  not  affect  the  shape  of  T\ .  Let  P  :  [0,  /]  — >►  T 
and  P\  :  [0,  /]  — >►  T\  denote  unit  speed  parametrizations.  We  now  suppose  that  the 
curvature  and  torsion  of  T  and  T\  at  P(t)  and  P\it)  coincide  for  all  t  and  thus  we 
have  the  Lrenet-Serret  apparatus  {T,  N,  B,  k,  x]  and  {T\ ,  N\ ,  B\,  k,  x]  for  T  and 
A  respectively.  Using  the  dot  product  in  M3  we  define  g:  [0,  /]  — >  R  by 


g(t)  =  T(t )  •  T\(t)  +  N(t )  •  N i(r)  +  B(f)  •  B\(t). 


By  our  hypothesis  g  (0)  =  3  and  by  the  Cauchy-Schwarz  inequality  (Example  3 .4) 
—3  <  git)  <  3  for  all  t  and  git)  =  3  if  and  only  if  T it)  =  T\it),  Nit)  =  N\(t)  and 
5(f)  =  5i(f).  Lrom  the  Lrenet-Serret  equations 

gf  =  kN  -T\+kT  •  2Vi  +  ( — fcT  +  t5)- Afi  +Af-(— /cri  +  r5i)  —  rA^-5i  —  r5-iVi  =  0. 


Hence  g  is  a  constant  mapping  and,  since  g(0)  =  3,  we  have  git)  =  3  for  all  t 
and  the  Lrenet-Serret  apparatus  is  the  same  for  both  curves.  In  particular 


(5  -  Px)\t)  =  P\t)  -  P[it)  =  Tit)  -  Tiit)  =  0 


92 


8  Geometry  of  Curves  in  M3 

and  P{t)  =  P\{t)  +  C  for  all  t.  Since  P{ 0)  =  A(0)  this  implies  P(t)  =  P\(t)  for 
all  t  and  one  curve  lies  on  top  of  the  other.  We  conclude  that  r  and  A  have  the  same 
shape. 


Exercises 


8.1  Show  that  each  of  the  directed  curves  in  Exercise  7.5  is  a  generalised  helix.  In 
each  case  find  a  unit  vector  u  such  that  (T(t),  u)  is  independent  of  t . 

8.2  Let  P{t)  =  (£,1  +  t~l,  t~l  —  t),  1  <  t  <  2,  denote  a  parametrization  of  the 
curve  r  in  M3.  Show  that  B{t)  =  (l/\/3,  —  1/V3,  l/x/3)  for  all  f  and  hence 
deduce  that  F  lies  in  the  plane  v  —  y  +  z  =  —  1 .  Find  the  Frenet-Serret  apparatus 
for  r. 

8.3  If  r  is  parametrized  by 


P{0)  =  (log  cos  0,  log  sin  0,  V20),  —<0<  — 


show  that  r  has  curvature  sin  20 />/2  at  P(0). 

8.4  For  the  curve  parametrized  by  P(t)  =  (3 12,  3t  —  t3 ,  3t  +  f3),-i  <  t  <  1,  show 
that 

1 

Kit)  =  —  Tit)  =  - z-^r. 

3(1  +  t2)2 


Find  a  unit  vector  u  such  that  (T(t),  u)  is  independent  of  t. 

8.5  Find  the  curvature  and  torsion  of  the  curve  parametrized  by 


P(t)  =  (er  cosG  er  sin  t,  er),  t  e  R. 


8.6  The  plane  through  a  point  on  a  curve  perpendicular  to  the  tangent  line  is  called 
the  normal  plane  to  the  curve  at  the  point.  Show  that  a  curve  lies  on  a  sphere  if 
the  intersection  of  all  normal  planes  is  non-empty.  Hence  show  that  the  curve 
parametrized  by 

P{0)  =  (—  cos  20,  —2  cos  0,  sin  20),  0  e  [0,  2jr ] 

lies  in  a  sphere.  Find  the  centre  and  radius  of  the  sphere. 

8.7  Show  that  the  curve  parametrized  by  P(t)  =  {at,  bt 2 ,  t3),  ab  ^  0,  is  a  gener¬ 
alised  helix  if  and  only  if  4 b4  =  9a2. 


Chapter  9 

Double  Integration 


Summary  We  define  the  double  integral  of  a  function  over  an  open  subset  of  R2  and 
use  Fubini  ’s  theorem  to  evaluate  such  integrals.  We  discuss  the  fundamental  theorem 
of  calculus  in  M2 — Green  s  theorem. 

We  discuss  ( double )  integration  of  a  real- valued  function  of  two  variables  f(x,y ) 
over  an  open  set  Q  in  Mr.  Motivated  by  the  one-dimensional  theory  we  divide  Q 
into  rectangles — the  natural  analogue  of  intervals — by  first  drawing  horizontal  and 
vertical  lines  and  thus  partitioning  the  v-  and  y-axes. 

Let  Xi  denote  a  typical  element  of  the  partition  of  the  the  v-axis  and  let  yj  be  a 
typical  element  on  the  y-axis.  The  resulting  grid  of  rectangles  gives  a  partition  of  C2 
(Fig.  9.1)  and  we  form  the  Riemann  sum 

y.  y,  f(xi,yj )  •  Axi  ■  Ay j,  Axi  =  xi+i  -  xt,  Ayj  =  yj+l  -  yj 

i  j 

where  we  sum  over  all  rectangles  which  are  strictly  contained  in  Q .  If  this  sum  tends 
to  a  limit  as  we  take  finer  and  finer  partitions  we  say  that  /  is  integrable  over  F2  and 
denote  the  limit  by 

f(x,  y)  dxdy. 

We  call  this  the  integral  (or  double  integral )  of  /  over  Q .  If  Q  is  the  inside  of  a 
closed  curve  F  and  /  is  continuous  on  Q  it  can  be  shown  that  /  is  integrable  over  F2 . 
When  f(x,y)  =  1  for  all  (x,  y)  e  F2  the  Riemann  sum  is  the  area  of  the  rectangles 
in  the  partition  inside  F  and  on  taking  a  limit  we  obtain 

dxdy  =  Area  of  Q . 
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Fig.  9.1 


V 


y  =  h(x) 


Q 


y  =  sfa) 


Fig.  9.2 

If  /(*,  y)  >  0  then  the  volume  of  the  solid  over  £2  and  beneath  the  graph  of  /  is 


f(x,  y)dxdy. 


We  only  evaluate  double  integrals  over  rather  simple  open  sets.  An  open  set  is  said 
to  be  of  type  I  if  it  is  bounded  above  by  the  graph  of  a  continuous  function  y  =  h(x), 
bounded  below  by  the  graph  of  a  continuous  function  y  =  g(x)  and  on  the  left  and 
right  by  vertical  lines  of  finite  length  (see  Fig.  9.2). 

Take  a  fixed  interval  in  the  partition  of  the  v-axis,  say  (xi ,  Xi+ 1),  and  consider  the 
terms  in  the  Riemann  sum 


X  X  f{Xi  >  yy  •  Axi  ■  Ayj 

i  j 


which  only  involve  Axi  =  \  —  v/.  This  gives  the  sum 


(X  /(x«’  >v)  -  ^>7) 

j 


Axi. 


Taking  limits — this  can  be  justified  for  continuous  functions — we  get 
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Z/(-V'--V/)-Av/ 

j 


h(xi ) 

■»  J  f(xi,y)dy 

g(*i) 


as  we  take  finer  and  finer  partitions  of  the  y-axis.  Let 


h{x) 

H(x)=  f  f(x,y)dy. 
g(x) 


Then 


b 


■  Ax,  ■  Ay  j 


F,  H  (Xi )  •  Ay, 


//(v)  dv 


hj 


a 


and  taking  the  limit  on  both  sides  we  get 


b 


b  h(x) 


f(x,y)dydx=  /  H(x)dx  = 


f{x,  y)dy 


dx 


£2 


a 


a  g(x) 


This  method  of  integration,  together  with  the  similar  method  obtained  by  reversing 
the  roles  of  x  and  y,  is  known  as  Fubini’s  theorem.  We  define  an  open  set  to  be  of 
type  II  if  it  is  bounded  on  the  left  and  right  by  the  graphs  of  continuous  functions  of 
y,  k  and  /,  which  are  defined  on  the  interval  [c,  d]  and  above  and  below  by  horizontal 
lines  of  finite  length.  For  domains  of  type  II  Fubini’s  theorem  is 


d  l(y ) 


f(x,  y)  dxdy  = 


f(x,  y)dx 


Q 


k(y) 


dy. 


If  we  are  given  an  open  set  we  recognise  it  is  of  type  I  if  each  vertical  line  cuts  the 
boundary  at  two  points  except  possibly  at  the  end  points  and  type  II  if  each  horizontal 
line  cuts  the  boundary  at  two  points  except,  perhaps,  at  the  end  points. 

Example  9. 1  Evaluate 


/ /  .  - dxdy 

JJ  716  +  y7 

over  the  set  Q  bounded  above  by  the  line  y  —  2,  below  by  the  graph  of  y  —  x 1//3 
and  on  the  left  by  the  y-axis  (Fig.  9.3).  By  inspection  the  domain  Q  is  of  type  I  and 
type  II  and  we  have  a  choice  of  method,  i.e.  we  can  integrate  first  with  respect  to 
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Fig.  9.3 


y 


o 


y 


3 


Fig.  9.4 


either  variable.  Our  choice  may  be  important  since  one  method  may  be  very  simple 
and  the  other  quite  difficult. 

We  have  to  evaluate  two  integrals  of  a  single  variable.  In  the  first  integral,  the 
inner  integral,  one  of  the  variables  takes  a  fixed  value  and  is  really  a  constant.  Thus 
we  have  to  first  evaluate  either 


dy_ 

yi6+ j7 


In  these  situations  looks  are  usually  not  deceiving  and  we  opt  for  the  simpler  looking 
integral.  So,  we  choose  to  integrate  first  with  respect  to  v.  The  limits  of  integration 
in  the  first  integral  will  influence  the  degree  of  difficulty  that  arises  in  evaluating 
the  second,  or  outer,  integral.  If  you  run  into  problems  with  the  second  integral  you 
should  consider  starting  again  using  a  different  order  of  integration.  In  our  case  we 
have  decided  to  consider 


dx 


and  now  need  to  determine  the  variation  in  v  for  fixed  y.  We  draw  a  typical  line 
through  Q  on  which  y  is  constant — i.e.  a  horizontal  line. 

We  must  now  express  the  end  points  in  terms  of  y.  Using  once  more  Fig.  9.3 
we  see  that  the  end  points  of  the  line  of  variation  of  v  are  (0,  y)  and  (x,  y)  where 
y  =  x1/3.  Hence  y3  =  x  and  we  have  the  required  variation  of  x  (Fig.  9.4). 

We  see  also  that  y  varies  from  0  to  2.  Hence 


Q 


X 


dxdy  = 


v  dx 


yi6T7 


dy 


\/l6  +  y1 


o  o 
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x2  dx 


=J 

0 

2  yj  16  +  y 2 

2 

_  f 

/ 

J 

0 

2^16  +  y1 

1 

f  dw  i 

““  14 

J  jw  l 

1 

W>/2  1 

r 


o 


dy 


dy 


,7 


14  1/2  7 

=  I((144)1/2  -  (16)1/2)  = 


Example  9.2  In  this  example  we  reverse  the  order  of  integration  and  evaluate 


V^y 


o 


(x  +  y)  dx 


dy, 


From  the  limits  of  integration  in  the  inner  integral  the  left-hand  side  of  the  domain 
of  integration,  £2,  is  bounded  by  the  line  x  —  \  and  the  right-hand  side  by  points 
satisfying  v  =  ^/4  —  y,  i.e.  x2  =  4  —  y  or  y  =  4  —  x2.  Hence  the  right-hand  side 
is  bounded  by  the  graph  of  y  =  4  —  x2  and  we  have  the  following  diagram  for  our 
domain  (Fig.  9.5). 

Reversing  the  order  of  integration  we  get 


4—x ' 


(x  +  y)  dy 


0 


dx  = 


vy  + 


i) 


A—xd 


0 


dx 


2  (4  —  x2)2\  241 

x(4  —  x  )  H - - - I  dx  =  (eventually). 


The  fundamental  theorem  of  one -variable  calculus 

b 

f(b)  -  f  (a)  =  J  f'(t)dt 

a 


(9.1) 


is  used  to  evaluate  integrals  of  certain  functions  over  intervals  from  their  boundary 
values.  In  this  theorem  integration  on  the  right-hand  side  is  over  a  directed  interval 
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Fig.  9.5 


while  on  the  left  positive  and  negative  signs  are  assigned  to  the  initial  and  final  points 
of  the  interval  respectively.  Thus  we  see  that  a  certain  coherence  has  to  be  established 
between  the  orientations  on  the  two  sides  of  (9.1).  The  fundamental  theorem  of  two- 
variable  calculus  is  known  as  Green  ’s  theorem.  To  obtain  this  result  by  an  immediate 
application  of  the  one- variable  theorem  it  is  usual  to  begin  with  an  open  subset  Q  in 
M2  which  is  of  type  I  and  type  II.  In  applying  (9.1)  it  is  necessary  to  be  careful  with 
signs  and  this  means  that  in  Green’s  theorem  the  boundary  of  Q,  r,  is  oriented  in 
an  anticlockwise  or  counterclockwise  direction. 


Theorem  9.3  (Green’s  Theorem)  Let  P  and  Q  denote  real-valued  functions  with 
continuous  first-order  partial  derivatives  on  the  open  subset  U  of  M2.  if  r  is  a 
closed  curve  directed  in  an  anticlockwise  direction  such  that  the  interior  ( inside )  Q 
of  r  is  an  open  set  of  type  I  and  type  II  and  f  U  Q  C  U  then 


J  Pdx  +  Qdy 
r 


Proof  We  show 


dQ 

dy 


dxdy 


(9.2) 


and  for  this  we  use  the  type  II  property  of  Q .  From  the  representation  in  Fig.  9.6  we 
have 


Ky) 


Ky ) 


(x,  y)dx  =  Q(x,  y) 


By  (9.1) 


Ky) 

Ky) 


Q(l(y),y )  -  Q(k(y),y) 
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Fig.  9.6 


JJ  lhcdxdy  =  / (*2(/0’)>)’)  ~  G(fe(:v),.y))d;y. 

c 

On  the  other  hand,  using  the  parametrizations  y  — >  (/(y),  y)  and  y  — >►  (&(y),  y),  we 
obtain 


Qdy  =  J  Qdy  -  J  Qdy 

r  (graph  of  /)  (graph  of  k) 

d  d 

=  f  Q(l(y),y)dy  -  J  Q(k(y),y)dy. 


This  proves 


d_Q_ 

dx 


dxdy  =  /  Qdy. 


Q 


r 


The  equality 


Q 


dP\  f 

—  -^—Jdxdy  =  J  Pdx 


r 


is  obtained  in  the  same  way  and  this  completes  the  proof.  □ 

Green’s  theorem  is  true  for  many  other  sets  Q  and  the  proof  usually  proceeds 
by  partitioning  Q  into  sets  (£?/);  and  applying  the  simple  case  above  to  each 
(Fig.  9.7).  Note  that  each  new  curve  created  in  partitioning  Q  appears  as  part  of  the 
boundary  of  two  ’s  and  each  direction  along  these  new  curves  appears  precisely 
once.  Hence,  when  we  apply  (9.2)  to  each  and  add  them  together  the  integrals 
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(a)  (b) 


Fig.  9.7 


along  the  newly  created  curves  cancel  and  we  are  left  with  a  line  integral  over  the 
original  curve  r. 

A  glance  at  Fig.  9.7a  shows  that  we  no  longer  have  an  anticlockwise  oriented 
boundary.  In  our  new  situation  a  finite  number  of  piecewise  smooth  curves  form  the 
boundary  of  £2  and  as  we  proceed  along  r  in  the  given  direction  the  open  set  Q  lies 
on  the  left-hand  side.  With  these  modifications  Green’s  theorem  is  still  true.  We  are, 
of  course,  always  assuming  that  P  and  Q  are  nice  smooth  functions. 

Example  9.4  We  wish  to  calculate 


I  =  J  (5  —  xy  —  yz)dx  +  (2xy  —  x2)dy 
r 

where  r  is  the  boundary  of  the  unit  square  [0,  1]  x  [0,  1]  in  Mr.  By  Green’s  theorem 

l  l 


/  = 


3  2  3  2  \ 

(2xy  —  x  )  —  —  (5  —  vy  —  y  ))dxdy 


dx 


0  0 
l  l 


dy 


(2  y  —  2x  +  v  +  2  y)dxdy 


2-  -  =  -. 
2  2 


o 
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The  change  of  variables  rule  for  double  integrals  can  be  considered  as  a  special  case 
of  the  same  rule  for  triple  integrals  and  this  is  discussed  in  Chap.  14. 


Exercises 


9.1  Find  ff  x2  sin2  ydxdy  where  A  =  {(.v,  v)  G  Mr,  0<x<l,0<y<  n/4}. 

u 

9.2  Evaluate 

(a)  ff  x  cos(v  +  y)dxdy  where  U  is  the  subset  of  M2  bounded  by  the  triangle 

u 

with  vertices  (0,  0),  (tv,  0)  and 

(b)  ff  (. x 2  +  y2)dxdy  where  U  =  {(x,  y)  G  M2;  x2  +  y2  <  2 y}. 
u 

2 

(c)  ff  \dxdy  where  U  is  the  region  bounded  by  y  =  x,  y  =  2  and  xy  =  l. 

u  x 

9.3  Find  the  area  bounded  by  the  curves  x  =  y2  and  x  =4 y  —  y2. 

9.4  If  r  is  a  closed  anticlockwise  directed  curve  in  M2  with  interior  Q  and  F  is  a 
smooth  vector  field  on  an  open  set  containing  Q  Uf  show  that  Green’s  theorem 
is  equivalent  to 

r  r  div(F)  =  /  (F  •  n)ds 


Q 


where  n  is  the  outward  normal  to  F. 

If  /  is  harmonic  on  £2,  i.e. 

a2/  a2/ 

dx 2  dy 2 


=  0 


and  continuous  on  Q  U  F  show  that  f  (V/  •  n)  ds 

r 

9.5  By  reversing  the  order  of  integration  find 

6r  2  2  l 

(a)  f  f  e-v  dy  dx 

0  x/3 

4  ( Vy 

(b)  f  f  ty^xdx  dy. 

0  [y/2 


=  0. 


9.6  Find  the  volume  of 


{(*,  y,  z)  e  M3;  X2  +  y2  <  1,  x  >  0,  y  >  0,  0  <  z  <  1  -  xy] . 


9.7  If  Q  is  an  open  set  enclosed  by  the  anticlockwise  directed  curve  F  show,  using 
Green’s  theorem,  that 
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Area  (i2)  =  /  xdy  =  /  —ydx  =  —  /  xdy  —  ydx 


r 


r 


r 


Using  one  of  these  line  integrals  find  the  area  of  the  interior  of  the  ellipse  (x /4)2  + 
(y/5)2  =  l. 

9.8  Verify  Green’s  theorem  for  the  following  integrals: 

(a)  f  xy2dx  +  2 x2ydy,  r  is  the  ellipse  4x2  +  9y2  =  36 
r 

(b)  f  (x2  +  2 y3)dy,  r  is  the  circle  (x  —  2)2  +  y2  =  4 


r 


(c)  f  2x2y2dx  —  3yxdy,  r  is  the  square  bounded  by  the  lines  x  =  3,  x  =  5, 
r 

y=hy=4 

where  r  is  always  directed  in  an  anticlockwise  direction. 


Chapter  10 

Parametrized  Surfaces  in  R3 


Summary  We  discuss  theoretical  and  practical  approaches  to  parametrizing  a  sur¬ 
face  in  M3 . 

In  this  chapter  we  begin  a  systematic  study  of  surfaces  in  M3,  a  topic  which  will 
occupy  the  remaining  chapters  of  this  book.  We  begin  with  an  informal  discussion 
of  the  background  we  bring  to  this  investigation  and  reveal  our  general  intentions. 

Daily  we  encounter  most  of  the  classical  Euclidean  surfaces  in  M3  such  as  a 
sphere  (football,  globe),  cone  (ice-cream  cone),  ellipsoid  (egg,  American  foot¬ 
ball),  cylinder  (jar,  can),  plane  (floor,  wall,  ceiling)  and  the  non-Euclidean  torus 
(doughnut,  tube).  Many  of  our  examples  involve  these  surfaces  and  a  solid  geo¬ 
metric  understanding  of  these  should  be  deliberately  cultivated.  Initially  these 
surfaces  are  merely  subsets  of  M3  which  have  a  certain  recognisable  shape.  We 
have  already  established  mathematical  contact  with  them  as  level  sets  and  graphs 
of  functions.  Our  studies  have  been  confined  to  the  classical  problem  in 
differential  calculus  of  finding  the  maximum  and  minimum  of  a  scalar- valued  func¬ 
tion  over  a  surface  and  for  this  we  introduced  the  geometric  concepts  of  tan¬ 
gent  plane  and  normal  line.  This  essentially  summarises  the  formal  knowledge 
we  have  acquired  but  we  do  have  at  our  disposal  a  range  of  mathematical  ideas 
and  techniques  which,  if  not  directly  applicable,  will  often  hint  at  the  way  for¬ 
ward. 

Our  plan  is  to  develop  integration  theory  on  surfaces  and  to  investigate  the  geom¬ 
etry  of  surfaces.  Keep  in  mind  the  following  topics  as  particularly  relevant  to  the 
theories  we  develop: 

(a)  integration  over  open  subsets  of  M2  (Chap.  9) 

(b)  integration  along  directed  curves  in  M2  and  M3  (Chap.  6) 

(c)  the  geometry  of  curves  in  Mr  and  M3  (Chaps.  7  and  8). 

For  instance  an  open  subset  U  in  M2  can  be  considered,  or  identified  with,  a  surface 
U  in  M3  by  means  of  the  mapping 

(x,  y)  e  U - >  (x,  y,  0)  e  U. 


S.  Dineen,  Multivariate  Calculus  and  Geometry, 

Springer  Undergraduate  Mathematics  Series,  DOI:  1 0. 1 007/978- 1-4471 -64 19-7_  10, 
©  Springer- Verlag  London  2014 


103 


104 


10  Parametrized  Surfaces  in  M3 


Fig.  10.1 

Thus  any  theory  of  integration  over  a  surface  will  extend  integration  theory  over 
open  subsets  of  Mr.  We  can  say  more  since  parametrizations  (Definition  10.2)  are  a 
means  of  identifying  a  simple  surface  with  an  open  subset  of  M2  and  this  eventually 
allows  us  to  reduce  integration  over  a  surface  to  integration  over  an  open  subset  of 
M2  in  much  the  same  way  that  we  reduced  line  integrals  in  W1  to  integration  over 
an  interval  in  R  (the  key  word  in  this  sentence  is  unfortunately  “eventually”).  Thus 
our  theory  of  integration  over  surfaces  is  obtained  by  combining  and  developing 
techniques  already  used  in  (a)  and  (b).  We  shall  see  later  how  (b)  and  (c)  and,  in 
particular,  the  methods  used  to  derive  the  Frenet-Serret  equations,  can  be  extended 
to  investigate  the  geometry  of  surfaces  in  M3 . 

We  begin  our  formal  study  of  surfaces  by  defining  the  concept  of  a  parametrized 
surface  in  M3 .  Our  definition  is  mathematically  simple,  a  good  starting  point,  and 
carries  us  a  long  way  but  does  have  certain  inadequacies  that  we  discuss  as  we 
proceed. 

Definition  10.1  A  parametrized  surface  in  M3  consists  of  a  pair  (S,  0)  where  S  is 
a  subset  of  M3  and  0  is  a  bijective  mapping  from  an  open  subset  of  Mr  onto  S  such 
that  the  following  conditions  hold: 

(i)  0  has  derivatives  of  all  orders  (we  say  that  0  is  smooth  or  <^°°) 

(ii)  <fix  x  0V  7^  0  at  all  points. 

Condition  (ii)  is  the  analogue  of  P!  ^  0  for  a  parametrized  curve  in  M3  and  is 
equivalent  to  the  requirement  that  0X  and  0V  are  linearly  independent  vectors  at  all 
points  (see  Fig.  10.1). 

Definition  10.2  A  simple  surface  in  M3  is  a  subset  S  of  M3  for  which  there  exists  a 
mapping  0  such  that  (S,  0)  is  a  parametrized  surface.  We  call  0  a  parametrization  of 
5. 


One  notable  difference  between  the  above  definitions  and  the  corresponding  def¬ 
inition  of  directed  curve  (Definition  5.1)  is  the  change  from  a  closed  interval  in  R 
to  an  open  subset  of  M2  for  the  domain  of  parametrization.  For  a  directed  curve  the 
inclusion  of  end  points  in  the  domain  of  definition  leads  to  a  sense  of  direction  along 
the  curve  and  we  will  need  an  analogous  concept,  a  sense  of  orientation ,  to  develop 
vector- valued  integration  theory  over  a  surface.  However,  a  sense  of  direction  along  a 
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curve  can  also  be  obtained  by  using  tangent  vectors  at  interior  points  of  the  curve  and 
we  develop  the  concept  we  require  for  surfaces  by  using  the  interior  of  the  surface. 

Unfortunately,  many  of  the  classical  Euclidean  surfaces,  e.g.  the  sphere,  are  not 
simple  surfaces  but,  fortunately,  for  many  practical  purposes,  e.g.  the  calculation  of 
surface  area  and  the  evaluation  of  surface  integrals,  they  may  be  considered  as  simple 
surfaces.  We  will  make  this  precise  later  and  also  define  a  general  surface  in  M3. 

We  now  examine  three  specific  examples — graphs ,  surfaces  of  revolution  and 
the  classical  ellipsoid.  These,  although  apparently  rather  limited,  appear  in  many 
different  contexts  and  we  allow  them  to  divert  us  to  essential  ideas  associated  with 
any  parametrized  surface. 

Example  10.3  (see  Example  7.1)  Let/:  U  C  M2  — >  M  denote  a  smooth  function 
defined  on  the  open  subset  U  of  Mr  and  let  S  denote  the  graph  of/,  i.e. 

s  =  { (x,  y,f(x,  y))  :  (x,  y)  e  U}  . 

The  form  of  S  immediately  gives  a  parametrization  </>  defined  by 

<P(x,y)  =  (x,y,f(x,y)) 

with  domain  U.  Since  <(>x  =  (1,  0 ,fx)  and  <j>y  =  (0,  1  ,fy)  we  have 


<Px  X  <py  = 


i  j  k 

1  0  fx 

0  1  fy 


As  the  final  coordinate  of  <px  x  <py  is  always  1  we  have  <px  x  <pv  ^  0.  If  <j>(x\,y{)  = 
<p(x2,  }’2  )  then 

(xi,yi,f(xi,yi))  =  (x2,y2,f(x2,y2))- 


Hence  x\  =  X2  and  y \  =  y 2  and  so  0  is  injective.  Clearly  <f>(U)  =  S. 

As  a  particular  example  consider  the  unit  sphere  S  :  x2  +  y2  +  z2  =  1 .  We  have 
z2  =  1  —  x2  —  y2  and  z  =  ±(1  —  x2  —  y2)1^2.  Hence  the  upper  hemisphere  of  S  is 
the  graph  of  the  function 

g(x,y)  =  (1  -x2  -y2)1/2 

and  the  mapping  g:  (x,  y)  (x,  y ,  (1  —  x2  —  y2) 1//2)  is  a  parametrization  of  the  upper 
hemisphere.  The  domain  of  g  is  the  disc  {(v,  y)  e  M2  :  x2  +  y2  <  1}.  To  visualise 
this  physically  imagine  the  hemisphere  as  a  dome  S  and  take  U  to  be  the  floor  of 
the  dome.  If  you  can  now  accept  yourself  as  a  point  p  on  the  floor  and  look  directly 
upwards  your  eyes  will  focus  on  a  unique  point  q  in  the  dome  S  (Fig.  10.2).  Moving 
around  you  will  be  identifying  the  points  of  U  with  points  of  the  surface.  The  floor  U 
is  the  domain  of  the  parametrization  while  the  parametrization  itself  is  the  mapping 
which  takes  p  to  q.  The  fact  that  you  are  always  looking  in  the  same  direction  means 
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z 


Fig.  10.3 

that  the  identification  or  parametrization  is  given  by  a  set  of  parallel  lines  (Fig.  10.2) 
and  since  parallel  lines  never  meet  the  mapping  is  injective  or  one-to-one.  Since  each 
point  on  the  dome  is  hit  by  an  arrow  we  also  have  an  onto  or  surjective  mapping 
and  thus  a  bijective  mapping.  In  general  any  natural  identification  of  a  flat  set  with  a 
surface  by  means  of  parallel  lines  will  lead  to  a  parametrization  (see  Example  1 1.2). 

A  point  P  in  Mr  can  be  identified  by  means  of  its  distance  r  from  the  origin  and 
the  angle  0  between  the  positive  v-axis  and  the  vector  OP  (Fig.  10.3). 

We  call  (r,  6)  the  polar  coordinates  of  P.  Polar  coordinates  are  particularly  useful 
when  dealing  with  circles  and  discs  and  with  functions  involving  the  expression 
x2  +  y 2  in  Cartesian  coordinates.  Unfortunately,  the  correspondence 

(r,  0) - >  (x,  y)  =  (r  cos  0,  r  sin  6) 

is  not  bijective  but  we  get  around  this  difficulty  by  removing  a  small  portion  of  the 
domain.  The  domain  of  the  parametrization  of  the  hemisphere  S  given  above  is  the 
disc  x2  +  y1  <  1  and  suggests  the  use  of  polar  coordinates.  Since 

(1  —  x2  —  y 2)1/2  =  (1  —  r2  cos2  6  —  r2  sin2  0)1/2  =  (1  —  r2)1/2 

the  mapping 


F :  (r,  0)  e  (0,  1)  x  (— n,  tv)  — >  (r  cos0,  r  sin0,  (1  —  r2)1/2) 


107 


10  Parametrized  Surfaces  in  M3 


Fig.  10.4 


is  a  bijective  mapping  onto  S  \  T  where  r  is  the  set 

{(-r,0,(l -r2)1/2),  0<r  <  1  j  . 


On  (0,  1)  x  (—7 r,  i r) 


Fr  x  Fo  = 


r 2  cos  0  r 2  sin  0 

(1  _  r2)l/2  ’  (J  _  r2)l/2 


is  non-zero  and  we  have  found  a  second  parametrization  of  (almost)  the  whole  hemi¬ 
sphere. 

Example  10.4  Let  P(t )  =  (x(t),  y(t)),  t  e  [a,  b ]  denote  a  directed  curve  r  in  Mr. 
We  suppose  y(t)  >  0  for  all  t.  The  surface  obtained  by  revolving  this  curve  about 
the  v-axis  is  called  the  surface  of  revolution  of  P  about  the  v-axis  (Fig.  10.4). 

Consider  a  typical  point  on  the  circle  obtained  by  rotating  the  point  (x(t),  y(t)). 
The  first  coordinate  remains  unchanged.  The  second  coordinate  will  generate  a  cir¬ 
cle  in  the  (y,  z) -plane  with  centre  (0,  0)  and  radius  y(t).  Using  the  standard  para¬ 
metrization  of  the  circle  we  see  that  a  typical  point  on  this  circle  has  coordinates 
(y(0  cos  0,  y(t)  sin  0) .  Putting  these  together  we  obtain  a  parametrization 

(t,0) - >  {x(t),  y(t)  cos  9,  y(t)  sin  0) 


where  t  e  ( a ,  b)  and  0  e  Now 


<pt  =  (x'(t),y'(t)cos0,y\t)sin0) 


and 


<fie  =  (0,  —y(t)  sin  6 ,  y it)  cos  6) . 


Hence 
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i 


4>t  x  4>e  = 


x'(t) 

0 


j  k 

y'it)  cos  9  y(f)sin0 
— y(f)sin0  y(t)  cos  0 


=  {y'(t)y(t),  -x'(t)y(t)  cos  9,  -x{t)y(t)  sin6>). 


Since 


110/  x  <j>e\\  =  ( (y'fy 2  +  (. x'fy 2  cos2  6  +  (. x')2y 2  sin2  6>)1/2 

=  y(t)  (y  (02  +  x'  {t)2y 1/2  =  y(t)  \\p'(t)  II  y  o 

we  have  <pt  x  <fio  ^  0.  Since  the  mapping  0  is  bijective  as  long  as  we  do  not  include 
0  and  0  +  In  in  the  domain  we  let  £/  =  (a,  b)  x  (0,  27t)  and  obtain  a  bijective 
mapping  onto  the  surface  of  revolution  with  one  curve,  the  original  or  profile  curve, 
removed.  Many  classical  surfaces  may  be  realised  as  surfaces  of  revolution;  e.g.  the 
cone  (Fig.  10.5a),  cylinder  (Fig.  10.5b)  and  sphere  (Fig.  10.5c). 

These  induce  the  following  parametrizations: 


Cone 

(*,  0)  - 

— >  (ht,  rt  cos  6,  rtsmO), 

Cylinder 

(f,  0)  - 

— >  (t,  r  cos  6,  r  sin0), 

Sphere 

C t ,  0)  - 

— >  (r  cos  t,  r  sin  t  cos  6,  r  sin  t  sin  6). 

We  parametrize,  in  Example  11.1  the  torus  as  a  surface  of  revolution. 
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We  now  examine  in  considerable  detail  a  particular  surface — the  standard  ellip¬ 
soid.  This  includes  the  sphere  as  a  special  case.  We  use  this  example  as  an  excuse 
to  explore  and  comment  on  many  of  the  practical  and  theoretical  considerations  that 
arise  in  studying  any  surface  and  to  shed  some  light  on  the  necessity  and  significance 
of  later  developments.  In  studying  this  example  the  reader  should  keep  in  mind  that 
parametrizations  are  nothing  more  than  coordinate  systems,  that  we  are  always  trying 
to  interpret  mathematical  facts  geometrically  and  constantly  attempting  to  articulate 
mathematically  our  geometric  observations. 

Example  10.5  We  consider  the  ellipsoid 


x1  y2 
a 2  b 2 


This  specialises  to  the  sphere  of  radius  a  centred  at  the  origin  when  a  =  b  =  c. 
Two  of  the  most  frequently  used  coordinate  systems  on  the  sphere  are  spherical 
polar  coordinates  and  geographical  coordinates.  These,  as  we  shall  see,  are  closely 
related.  Spherical  polar  coordinates  are  more  popular  in  the  mathematical  literature 
but  since  geographical  coordinates  are  in  everyone’s  normal  experience  we  devote 
more  time  to  them  here. 


The  representation  of  the  ellipsoid  as  a  level  set  of  a  sum  of  squares  suggests 
we  use  the  elementary  identity  sin2  0  +  cos2  6  =  1  to  develop  our  parametrization. 
Rewriting  the  formula  for  the  ellipsoid  as  a  sum  of  two  squares  we  get 


Now  let 


x2  y 2  z 

-7T  +  —  =  cos  0  and  -  =  sin  0, 


a ‘ 


b2 


c 


Hence  z  =  c  sin  6  and 


x2+i=cos20,  i.e.  (^)2  +  (^_V  = 

\a  cos  0  /  \bcosO / 


a 


2  b2 


=  1 


A  further  similar  substitution  gives 


x  y 

—  cos  if/  and  - =  sin  i jr. 


a  cos  6 


bcosO 


This  implies  x  =  a  cos  0  cos  t jr  and  y 
by  F,  has  the  form 


b  cos  0  sin  .  Our  parametrization,  denoted 
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Fig.  10.6 

F :  U - >  S  (ellipsoid) 

(6,  i/0 - >  ( a  cos  9  cos  x/r,  b  cos  9  sin  x/r,  c  sin  9) 

but  we  must  specify  U,  check  that  F  is  bijective  and  see  that  conditions  (i)  and  (ii) 
are  satisfied.  Geometrically  we  have  the  following  diagram  (Fig.  10.6)  in  the  case  of 
the  sphere  a  =  b  =  c. 

Thus  we  project  the  point  P  =  (x,  y,  z)  onto  the  xy-plane  to  get  the  point  Q  = 
(x,  y,  0).  The  angle  9  is  the  angle  between  the  vectors  OQ  and  OP  and  hence  9  = 
tan-1  (z  /  (. x 2  +  y2)1/2).  The  level  set  of  the  surface  through  P  parallel  to  the  xy- 
plane  is  a  circle  (remember  we  are  just  considering  a  sphere  here)  and  this  is  also 
projected  onto  the  xy-plane  and  the  angle  xf/  is  obtained  by  using  polar  coordinates 
on  this  circle.  Hence  xfs  =  tan_1(y/x).  From  Fig.  10.6  it  is  clear  that  0  ranges  over 
the  interval  (—tt/2,  tt/2)  and  x/r  over  the  interval  (0,  2tz).  Clearly  the  smoothness 
condition  (i)  is  satisfied  no  matter  what  domain  U  we  choose  for  F.  In  many  examples 
condition  (ii)  is  easily  checked  and  formal  identification  of  the  range  of  F  is  obtained 
using  a  diagram.  In  this  example,  however,  we  adopt  a  more  analytic  approach  in 
verifying  that  F  is  a  parametrization. 

We  have 

Fq  =  (—a  sin  0  cos  x/r,  —b  sin  0  sin  x/r,  c  cos  0) 


and 


F^  =  (—a  cos  0  sin  xjs,  b  cos  0  cos  xjr,  0). 


Hence, 


Fq  X  F \j/  — 


i  j  k 

—a  sin  0  cos  x/r  —b  sin  0  sin  x/s  c  cos  0 
—a  cos  6  sin  xjr  b  cos  6  cos  xfr  0 

9  9 

=  {—be  cos  9  cos  x/r,  —ac  cos  9  sin  x/r,  —ab  cos  9  sin 

cos  9  cos  x j/  cos  9  sin  xj/  sin  9 


=  —  abc  cos  9 


(: 


a 


b 


c 


) 


Ill 
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If  a  =  b  =  c  then 

|| Fq  x  Ff\\  =  a 2  cos 6 (cos2  6  cos2  0  +  cos2  6  sin2  0  +  sin2  6 )^2 

=  a2cos0.  (10.1) 

Before  proceeding  we  make  a  brief  observation  which  we  develop  in  more  detail 
in  Chaps.  16-18.  If  we  fix  0  and  let  0  vary  we  obtain  a  mapping 

0 - >  F(0,  0) 

where  6  ranges  over  an  interval  in  R.  This  defines  a  directed  curve  which  lies  in  S  with 
tangent  Fq.  Similarly  F ^  is  tangent  to  the  curve  0  — >  F(6,  0)  in  S.  These  curves 
are  called  coordinate  curves  (of  the  parametrization).  The  vectors  Fq  and  F f  lie  in 
the  tangent  space  of  S  at  P  and  are  called  tangent  vectors.  Tangent  vectors  enable  us 
to  define  directional  derivatives  of  functions  defined  on  the  surface.  If  G  :  S  — >  W1 
and  P  e  S  then  the  directional  derivative  of  G  at  P  in  the  direction  of  the  tangent 
vector  v  at  P  is  given  by 


DyG(p)  =  -^(Go0)(OUo 

dt 

where  0  is  any  differentiable  mapping  from  (—a,  a)  into  S  such  that  0(0)  =  P  and 

000)  =  v. 

Condition  (ii)  says  that  F $  and  Fq  span  the  tangent  space  and  the  following 
relationship  holds  in  any  surface  S. 


{The  tangent  space  of  S  at  P}  = 


Since  Fq  x  F^  ^  0,  the  vector  Fq  x  F ^ 
hence  parallel  to  the  normal.  In  our  case  i 


set  of  tangents  at  P  of  all  curves 
in  S  which  pass  through  P 

all  linear  combinations  spanned 
by  the  partial  derivatives  at  P 
of  any  parametrization 

set  of  directional  derivatives 
at  P  which  operate  on 
functions  defined  on  S. 

is  perpendicular  to  the  tangent  space  and 
the  surface  S  is  the  level  set  g~ 1  (0)  where 


2  2  2 

yz  z 

g(x,y,z )  =  —  +  yy  +  —  -  I- 

aL  fiz  cz 

We  now  confine  ourselves  to  the  special  case  of  the  unit  sphere,  i.e.  the  ellipsoid  with 
a  =  b  =  c  =  1.  The  adjustments  necessary,  when  this  case  has  been  considered,  in 
order  to  examine  the  general  situation  are  fairly  minor.  We  have  seen  in  Chap.  2  that 
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the  normal  is  parallel  to  Vg  and  by  inspecting  (10.1)  we  see  that  Vg(P)  is  parallel  to 
Fe(P)  x  F ( P ).  This  observation  can  be  used  to  partially  check  calculations  and,  in 
some  cases,  may  even  be  used  to  avoid  calculating  F ^  x  F^.  We  have  thus  related 
our  earlier  concepts  of  tangent  space  and  normal  with  terms  which  can  be  calculated 
from  any  parametrization. 

We  next  turn  to  find  a  domain  U  for  F  and  to  verify  that  all  requirements  necessary 
for  a  parametrization  are  satisfied  by  (F,  U).  Initially,  it  is  usually  better  to  choose 
the  domain  of  parametrization  to  be  as  large  and  as  simple  as  possible.  These  twin 
aims  may  not  always  be  compatible  with  the  requirements  for  a  parametrization  or, 
indeed,  with  one  another  so  some  judgement  is  necessary  and  this  usually  comes 
with  experience. 

By  (10.1),  xF^  7^  0  if  and  only  if  cos  0^0  and  a  natural  choice  for  the  domain 
of  0  is  the  interval  —it /2  <  6  <  n/2.  This  is  also  suggested  by  Fig.  10.6.  Since  both 
the  sine  and  cosine  functions  have  period  2tt  the  length  of  the  interval  of  definition 
of  x jr  cannot  be  greater  than  In .  The  interval  (0,  27r)  is  a  natural  choice  and  leads  to 

U  =  {(0,  x/r)  :  —n  12  <  0  <  n/2,  0  <  xfr  <  2n} 

as  our  domain  for  F.  Clearly  the  set  U  is  open.  If  F(0\ ,  xfr\)  =  F(62,  xj/ 2 )  then, 
comparing  final  coordinates  in  F,  we  get  sin^i  =  sin  62 .  Since  the  sine  function  is 
injective  on  (—n/2,  n/2)  this  implies  0\  = O2 .  Moreover,  cos  0  7^  0  on  (—n/2,  n/2) 
and  comparing  the  first  two  coordinates  of  F  we  see  that  cos  t/q  =  cos  xj/ 2  and 
sini/q  =  sin  ^2-  Since  (cos(V^),  sin(t/^)),  0  <  xfr  <  2n,  are  the  coordinates  of  a 
unique  point  on  the  unit  circle  we  have  xjr\  =  xf/2  and  F  is  injective  on  U. 

We  have  thus  shown  that  F  is  a  parametrization  of  its  range  or  image,  F(U).  By 
our  construction  the  image  of  F  lies  in  the  ellipsoid.  Does  it  cover  the  full  ellipsoid?  If 
F(0,  xfr)  =  (0,  0,  c)  then  sin#  =  1.  However,  —n/2  <  0  <  n/2,  and  so  (0,  0,  c)  £ 
F(U)  and  F  does  not  cover  the  ellipsoid.  To  find  the  range  of  F  we  proceed  as 
follows.  The  set  U  is  an  open  rectangle  in  Mr  and  its  boundary  is  the  perimeter  of 
this  rectangle.  The  function  F  has  a  nice  smooth  extension  from  U  to  its  boundary 
(Fig.  10.7)  and  the  image  of  this  boundary  is  the  boundary  of  the  image  of  F  in  the 
surface  we  are  examining. 


Fig.  10.7 
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Fig.  10.8 


Fig.  10.9 

Since  F(0,O)  =  F(0,  2n)  =  (— acosO,  0,  csinO),  F(n/2,  \/f)  =  (0,  0,  c), 
F(—n/2,  \jr)  =  (0,  0,  — c),  and  —n /2  <  0  <  7t/2  the  boundary  of  U  is  mapped  onto 
one  half  of  the  ellipse  (. x/a )2  +  (z/c)2  =  1 ,  y  =  0.  Hence  F  is  a  parametrization  of  the 
ellipsoid  with  one  curve  removed,  the  half  ellipse  PQR  in  Fig.  10.8.  As  we  develop 
geometric  insight  we  will  find  that  a  good  sketch  often  leads  to  a  rapid  identification 
of  the  domain  of  definition  of  the  parametrization  and  it  will  not  be  necessary  to  go 
through  the  protracted  investigation  we  have  just  completed. 

A  different  choice  of  domain  would  have  led  to  a  parametrization  which  almost 
certainly  would  have  covered  a  different  part  of  the  ellipsoid  and  perhaps  covered 
the  curve  that  we  missed.  However,  no  parametrization  will  cover  the  full  ellipsoid. 
This,  although  intuitively  clear,  is  a  highly  non- trivial  mathematical  result. 

A  parametrization  identifies,  with  F  as  the  mode  of  identification,  a  flat  set  of 
points,  the  open  subset  U  of  M2,  with  a  subset  of  the  surface,  F(U),  in  a  one-to-one 
fashion.  So  essentially  we  are  taking  a  sheet  of  paper  U  and  using  F  to  wrap  it  around 
a  sphere.  The  first  stage  of  wrapping  turns  the  sheet  of  paper  into  a  cylinder  with 
the  sphere  inside  it  (Fig.  10.9).  This  identifies  the  boundary  lines  I  and  II  with  one 
another  (Fig.  10.7)  and  is  equivalent  mathematically  to  F(6,  0)  =  F(6,  In)  for  all  0. 

The  final  steps  in  the  wrapping  collapses  III  onto  (0,  0,  c)  and  IV  onto  (0,  0,  —  c). 
Mathematically  this  says  F(n/2,  x/s)  =  (0,  0,  c)  and  F(—n/2,  x/r)  =  (0,  0,  —  c)  for 
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Fig.  10.10 

all  xj/ .  We  thus  see  geometrically  that  if  we  include  the  boundary  of  U  in  the  domain 
of  F  then  the  curve,  which  we  previously  missed,  is  covered  twice  and  F  is  not 
one-to-one.  If  we  do  not  include  the  boundary  then  this  curve  is  not  in  the  image  of 
F. 

Next  we  discuss  parametrizations  as  coordinate  systems.  A  useful  initial  approach 
to  any  such  system  is  to  sketch  and  examine  the  coordinate  curves  on  the  surface.  In 
our  case,  since  we  are  still  looking  at  the  unit  sphere,  we  consider  the  curves  obtained 
by  fixing  one  of  the  variables  of  the  function 

F(6,  x/r)  =  (cos  0  cos  if/,  cos  6  sin  x/r,  sin0),  —it /2  <  6  <  7t/2,  0  <  xfr  <  2tx. 

Since  F  is  bijective,  a  point  on  the  surface  corresponds  to  a  unique  pair  (6 1,  xjr\) 
and  F(0\,  Vq)  is  the  point  of  intersection  of  the  coordinate  curves  0  — >►  F(6,  Vr), 
x/r  — >  F(0 1 ,  x/r).  For  this  reason,  the  pair  (6,  x/r)  is  often  referred  to  as  the  curvilinear 
coordinates  of  the  point  F  (6,  x/f).  Fixing  0  is  clearly  equivalent  to  taking  a  fixed  value 
of  z  and  amounts  to  taking  a  cross-section  of  the  unit  sphere  parallel  to  the  xy-plane. 
Geometrically  we  get  a  circle  (Fig.  10.10). 

This  may  also  be  seen  analytically  since  the  mapping  F(6,  xjr)  can  be  written  as 

(0,  0,  sin0)  +  cos  0 (cos  x/r,  sin  x/r,  0). 

As  xj/  varies  over  (0,  27r)  we  get  a  circle  (with  one  point  removed)  in  a  plane  parallel 
to  the  xy-plane,  with  centre  (0,  0,  sin  0)  on  the  z-axis,  and  radius  cos  0.  A  number  of 
level  sets  of  0  are  given  in  Fig.  10.11. 

Now  fix  xjr  and  let  0  range  over  the  interval  —tt/2  <  0  <  jt/2.  We  have 

F(0,  x//)  =  cos  6 (cos  xfr,  sin  x/r,  0)  +  sin 0(0,  0,  1). 

Since  (cos  x/r,  sin  x/r,  0)  and  (0,0,1)  are  perpendicular  unit  vectors  we  get,  by 
Pythagoras’  theorem,  a  semicircle  of  radius  1.  We  do  not  get  the  full  circle  because 
of  the  restricted  range  of  0.  This  is  easily  seen,  either  geometrically  or  from 
the  above  formula,  to  be  a  semicircle  running  from  (—1,0,  0)  to  (+1,  0,  0),  i.e. 
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Fig.  10.11 


Fig.  10.12 


from  the  bottom  to  the  top  of  the  sphere.  In  Fig.  10.12  we  sketch  a  number  of  these 
level  sets  of  xj/ . 

One  can  find  a  ready-made  example  of  this  by  examining  a  globe  representing 
the  Earth.  The  level  sets  of  0  are  the  lines  or  parallels  of  latitude  with  the  equator 
corresponding  to  0  =  0.  Instead  of  +  and  —  the  terms  North  and  South  are  used.  The 
level  sets  of  x/r  are  called  lines  of  longitude  or  meridians  with  the  level  set  xj/  =  0 
going  through  Greenwich,  England.  In  place  of  +  and  —  we  use  East  and  West  (of 
Greenwich).  The  range  here  is  from  0  to  180°  while  for  0  we  use  0  to  90°.  If  we 
represent  the  Earth  on  a  map  we  are  unwrapping  the  sphere,  or  mathematically,  taking 
the  inverse  of  a  parametrization.  The  most  common  map  (Fig.  10.13)  is  obtained  by 
using  Mercator’s  projection.  This  first  unwraps  the  sphere  onto  a  cylinder  in  such  a 
way  that  when  the  cylinder  is  unrolled  shapes  are  preserved  (but  area  is  distorted — see 
Exercise  1 1.8).  What  happens  if  we  replace  the  cylinder  by  a  cone? 

Note  that  East  meets  West  on  the  International  Date  Line  in  the  middle  of  the 
Pacific  Ocean.  There  is  a  certain  ambiguity  about  this  line — is  it  180°  East  or  180° 
West  of  Greenwich?  This,  once  more,  reflects  the  fact  that  we  do  not  obtain  a  bijective 
mapping  if  we  include  the  boundary  in  the  domain  of  definition  of  the  parametrization. 
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Fig.  10.13 


Different  maps  result  from  using  different  parametrizations  or  coordinate  systems. 
The  mapping  F  that  we  have  been  discussing  is  called  the  geographical  coordinate 
system.  It  is  as  useful  and  as  intuitive  a  system  as  the  more  popular  spherical  polar 
coordinate  system  that  we  now  discuss. 

A  small  change  in  our  method  of  introducing  geographical  coordinates  leads  to 
the  definition  of  spherical  polar  coordinates.  Recall  that,  for  the  ellipsoid,  we  have 


Now  we  interchange  the  role  of  sine  and  cosine  and  let 


-  =  cos  6  and  ( 
c  V 


x2  y 2  \  1/2 

—  +  =  sin  9 

az  bz  / 


This  implies 


(— )2  +  (— )2  =  i- 

\as,mO/  \bsm6/ 


Let  x/a  sin  6  =  cos  x/r  and  y/b  sin  0  =  sin  x/r.  This  defines  the  mapping 


G(0,  xfr)  =  ( a  sin  0  cos  x Jr,  b  sin  0  sin  x/r,  c  cos  0) 

which  are  the  spherical  polar  coordinates  of  the  point  (x,y,z)- 

From  Fig.  10.14  we  see  that  0  <  0  <  it  and  0  <  x/r  <  2tz  .  The  angle  0  is  called 
the  colatitude  and  xfr  is  sometimes  called  the  azimuth.  Note  that  we  obtain  the  same 
two  sets  of  coordinate  curves  for  geographical  and  spherical  polar  coordinates — the 
labelling  is,  however,  different. 

For  a  sphere  of  radius  r  with  centre  at  the  origin  this  gives 


G(0,  xfr)  =  (r  sin#  cos  x/r,  rsinO  sin  xfr,  rcosO). 
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Fig.  10.14 


(a) 


Fig.  10.15 


y 


Note  that 

Go  x  G f  =  r  sin  6  (sin  6  cos  x/r,  sin  0  sin  x/r,  cos  6) 
and 

||  Gq  x  G f  ||  =  r 2  sin0. 

If  we  wish  to  parametrize  a  part  of  the  sphere  we  can  still  use  G — and  hence  all  cal¬ 
culations  involving  G  and  its  derivatives — but  it  is  necessary  to  identify  the  restricted 
domain  on  which  we  are  working.  Fig.  10.14  is  useful  for  this  purpose.  Suppose,  for 
instance,  we  wish  to  parametrize  that  portion  of  the  sphere  of  radius  5  with  centre  at 
the  origin  which  lies  between  the  planes  z  —  —3  and  z  =  4. 

Figure  10.15a  and  b  gives  the  minimum  and  maximum  values  of  6,  0\  and  62 ,  and 
there  is  no  restriction  on  x)r .  Hence  our  parametrization  G  has  domain 

3  4 

U  =  {(0,  x j/)  :  sin_1(-)  <  0  <  n  —  sin_1(-),  0  <  x/r  <  2n}. 

From  Fig.  10.14  we  get  the  spherical  polar  coordinates  in  terms  of  the  Cartesian 
coordinates  (x,y,z): 
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Fig.  10.16 


/  2  ,  2  ,  2x1/2 

=  (x  +y  +  z  )  7  ■ 


0  =  tan' 


i^(x2  +y2)1/2^ 


and 


xf  =  tan 


The  parametrization  of  the  sphere  can  also  be  used  to  parametrize  the  inverted  vertical 
cone  of  height  a  and  base  radius  a  (Fig.  10.16a). 

Note  that  the  angle  between  the  z-axis  and  the  curved  surface  of  the  cone  is  7t/4. 
If  we  take  the  cross-section  of  the  cone  of  height  r,  0  <  r  <  a,  then  we  obtain  a 
circle  of  colatitude  7r/4  on  the  sphere  of  radius  ry/2  (Fig.  10.16b).  The  spherical 
polar  coordinates  of  this  circle  are 


(r\fl  sin  —  cos  xf,  r\! 2  sin  —  sin  xj/,  r\fl  cos  — ) , 

=  (r  cos  xjs,  r  sin  xjr,  r) 


i x 


71 


0  <  x/s  <  2tt, 


and  we  have  the  parametrization 

(r,  xfr) - >  (r  cos  x/s,  r  sin  xfr,  r),  0  <  x/r  <  2n,  0  <  r  <  a. 


This  completes  our  analysis  of  the  ellipsoid  and  sphere.  We  remark  that  many  of  our 
comments  apply  to  a  wide  variety  of  surfaces. 

A  simple  surface  may  have  different  parametrizations  and  depending  on  the  cir¬ 
cumstances  one  may  be  more  useful  than  another.  In  Examples  10.3  and  10.4  we 
derived  other  parametrizations  for  the  “sphere”.  An  important  technique  that  we 
discuss  in  Chap.  14  is  how  to  go  from  one  parametrization  to  another.  Since  para¬ 
metrizations  are  coordinate  systems  involving  variables  this  process  is  known  as  a 
change  of  variable. 

In  the  above  discussion  we  have  seen  that  ellipsoids  and  spheres  are  not  simple 
surfaces,  although  they  may  be  written  as  a  union  of  a  simple  surface  and  a  para¬ 
metrized  curve  and  for  practical  purposes,  such  as  integration,  they  may  be  regarded 
as  simple  surfaces.  Nevertheless,  it  is  desirable  to  have  a  definition  of  surface  in  M3 
which  implies,  for  instance,  that  a  sphere  is  a  surface.  Such  a  definition,  as  we  have 
just  noted,  requires  more  than  one  parametrization. 
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Definition  10.6  A  subset  of  M3  is  a  surface  if  it  can  be  covered  by  a  collection  of 
(generally  overlapping)  simple  surfaces. 

With  the  aid  of  the  following  important  and  highly  non-trivial  result  we  outline 
a  relationship  between  surfaces,  simple  surfaces,  level  sets  and  graphs.  This  also 
shows  that  an  ellipsoid  is  a  surface. 

Theorem  10.7  (Inverse  Mapping  Theorem.)  If  F  :  U(open) C  M77  — >  M77  is  con¬ 
tinuously  differentiable  and  F'(p )  is  invertible  at  p  e  U  then  there  exist  V  and  W 
open  in  W\p  e  V ,  such  that  the  restriction  of  F  to  V  defines  a  bijective  mapping 
from  V  onto  W  with  continuously  differentiable  inverse. 

Theorem  10.8  If  S  is  a  subset  o/R 3  then  the  following  are  equivalent: 

1 .  S  is  a  surface, 

2.  for  each  P  e  S  there  exists  s  >  0  such  that  S  H  {X  :  \\X  —  P||  <  £}  is  a  simple 
surface, 

3.  for  each  P  e  S  there  exists  s  >  0  such  that  S  Pi  {X  :  \\X  —  P||  <  £}  is  the  graph 
of  a  smooth  function, 

4.  for  each  P  e  S  there  exists  s  >  0  and  a  smooth  function  f  :  {X  :  \\X  —  P||  < 
s}  — >  R  such  that  Vf(X)  7^  0  for  \\X  —  PH  <  s  and 

S  n  {X  :  ||X  -  P||  <  £}  =  {X  :  ||X  -  P||  <  e,f(X)  =  0}. 


Proof  The  first  two  conditions  are  equivalent  by  Definition  10.6.  By  Example  1.7, 
graphs  are  level  sets  and,  by  the  Implicit  Function  Theorem  (Theorem  2.1),  level  sets 
are  locally  graphs  and  the  final  two  conditions  are  also  equivalent.  By  Example  10.3 
every  graph  is  a  simple  surface  and  thus  it  suffices  to  show  that  every  simple  surface 
is  locally  a  level  set. 

Suppose  0  :  I/(open  in  M2)  — >  S  is  a  parameterized  surface.  Define  F  :  U  x 
R  — >  R3  by  letting 

F(x,  y,  z )  =  0(x,  y)  +  zG(x ,  y) 

where  G(x,  y,  z)  =  y)  x  y).  Then  F  is  continuously  differentiable  and 

F'(x,  y,  0)  =  [fix(x,  y),  </>y(x,  y),  G(x,  y)] 

is  invertible.  If  P  e  S  then,  by  the  Inverse  Mapping  Theorem,  we  can  find  open 
subsets  of  M3,  V  and  W,  such  that  P  e  W,  F(V)  =  W  and  F  restricted  to  V  has 
a  continuously  differentiable  inverse.  Let  g  =  713  o  F~l  where  713 (v,  y,  z)  =  z.  If 
(x,y,  z)  6  W,  then  for  a  unique  (x',  y z')  E  V  we  have 

(x,  y,  z)  =  Fix' ,  y  ,  z!)  =  fix' ,  y ')  +  G(x' ,  y)z! . 


This  implies 
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g(x,y,z)  =  o  F~l(x,y,z)  =  0  <^=>-  7x3  o  F~l  o  Fix'  ,y',z)  =  z  =  0 

(x,  y,  z)  =  F{xf , /,  0)  =  cp(x',  y') 
(x,  y,z)  e  S 


and  hence  g  1(0)fl]y  =  Since  Vg(X)  ^  0  for  all  X  e  W  this  completes  the 

proof.  □ 


The  above  shows  that  graphs  are  much  more  than  simple  examples.  This  will 
become  very  evident  when  we  examine  curvature  in  the  final  three  chapters. 

Exercises 


10.1  By  using  cosh2  —  sinh2  =  1  and  cos2  +  sin2  =  1  parametrize  the  surfaces 


(a) 


(b) 


xA 


+ 


y 


a * 
r 


aA 


b 2 
2  2 

37  Z  =  1 
b2  c 2 


10.2  Show  that  the  following  are  parametrizations  of  simple  surfaces 


(a)  P\(r,  0)  =  (rcosO,  r  sin 0,0),  0  <  0  <  In,  0  <  r  <  1 

(b)  Pzix,  0)  =  {xcosO,  ysin0,v  +  y),  0<v<  1,  O<0  <27r,y^0 

(c)  P3(u,  v)  =  (u  +  v,  u  —  v,  uv),  0  <  u  <  1,  0<v<  1. 


10.3  By  using  a  parametrization  of  an  ellipsoid  parametrize  an  inverted  cone  of 
height  h  and  base  radius  r. 

10.4  Use  geographical  and  spherical  polar  coordinates  to  parametrize  that  portion  of 
the  sphere  x2  +  y2  +  z2  =  r 2  which  lies  in  the  first  octant  {(x,y,z)  :  x  >  0,y  > 
0,  z  >  0}. 

10.5  LetS  =  {(x,y,z)  :  x2-\-y2-\-z2  =  1}  denote  the  unit  sphere  in  M3.  If  (u,  v)  e  M2 
then  the  line  determined  by  (u,v,  0)  and  (0,  0,  1)  intersects  S  in  a  point  other 
than  (0,  0,  1).  Let  <p(u,v)  denote  this  point.  Find  the  coordinates  of  <p(u ,  v) 
in  terms  of  u  and  v.  Show  that  0(M2)  is  a  simple  surface  parametrized  by  0. 
Identify  0(M2). 

10.6  Find  the  length  of  the  portion  of  the  curve  u  =  v  lying  in  the  surface  parametrized 
by 

0  (u,  v)  =  (u  cos  v,  u  sin  v,  u a/3) 

which  lies  between  the  planes  z  =  0  and  z  =  2  V3. 

10.7  Let f(x,  y,z)  =  x + vy  +yz  and  let  S  =  /_1  (1).  Find  two  parametrized  surfaces 
which  lie  in  S  and  cover  S. 

10.8  Consider  a  line  L  initially  lying  along  the  positive  y-axis  and  attached  orthogo¬ 
nally  to  the  z-axis.  If  L  moves  along  the  z-axis  with  constant  speed  a  and  at  the 
same  time  rotates  about  the  z-axis  with  constant  speed  b  show  that  it  sweeps 
out  a  surface  parametrized  by 


0(m,  t)  =  (m sin bt,  ucosbt,  at),  u  >  0,  t  >  0. 


Chapter  11 

Surface  Area 


Summary  We  define  and  calculate  surface  area. 

We  follow  the  method  used  in  Chap.  5  to  calculate  the  length  of  a  curve  in  order 
to  define  the  (surface)  area  of  a  simple  surface  S  in  M3 .  Let  0 :  U  — >  S  denote 
a  parametrization  of  S  where  U  is  an  open  subset  of  Mr.  We  take  a  rectangular 
partition  of  U  (Fig.  11.1),  find  the  approximate  area  of  the  image  of  each  rectangle 
in  the  partition,  form  a  Riemann  sum  and  obtain  the  surface  area  as  the  limit  of  the 
Riemann  sums. 

If  R  denotes  a  typical  rectangle  in  U  (Fig.  11.2)  then 

fix  +  Ax,  y )  -  0(x,  y)  «  c/>x(x,  y)  Ax 
and 

0(-L  y  +  Ay)  -  0(v,  y)  «  0v(v,  y)  Ay. 

If  0  =  6(x ,  y)  is  the  rmgfe  between  <px{x,  y)  and  <fiy(x ,  y)  then  using  the  well-known 
formula  for  the  area  of  a  triangle,  ^ab  sin  C,  we  get 

Area(0(7?))  %  2  •  r\<px(x,y)\\  •  ||0y(x,  y)||  •  |  sin6>(x,  j)|AxAy 
=  || (px  x  4>y(x,y)\\AxAy. 

If  0  is  integrable  over  U,  and  this  will  be  the  case  if,  for  instance  U  is  bounded,  with 
smooth  boundary,  and  0  has  a  continuous  extension  to  U,  then 


y  Area(0 (/?//))  =  ^  110*  x  0y(x;,  >>_,•) II  Ax/A^- 

iJ 


> 


110,  x  0  V||  dxdy 


u 
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Fig.  11.2 


as  we  take  finer  and  finer  partitions  of  U.  Hence  the  surface  area  of  S ,  A(S ),  has  the 
form 


MS)  = 


II  4>x  x  (py  ||  dxdy 


u 


and  is  calculated  using  a  parametrization.  In  general,  a  surface  will  admit  many 
different  parametrizations  but,  as  we  will  see  later,  they  all  give  the  same  value 
for  surface  area.  In  this  chapter  we  are  using  the  usual  physical  idea  of  area  and 
angle.  These  non-negative  absolute  quantities  do  not  require  a  sense  of  direction  or 
orientation  on  the  surface  and  lead,  as  we  have  just  seen,  to  a  relatively  straightforward 
form  of  integration.  In  the  next  chapter  we  require  more  sophisticated  concepts  of 
area  and  angle  to  integrate  vector  fields  over  a  surface. 

We  now  obtain  another  formula  for  surface  area  which  avoids  the  cross  product. 
We  maintain  the  notation  0 :  U  ->  S  for  our  parametrization  and  introduce,  in  their 
traditional  form,  three  quantities  that  make  regular  and  important  appearances  in  the 
remaining  chapters  of  this  book.  Let 


E  (or  E(x,  y))  =  <px  ■  (px  =  \\<px\\2 
F  (or  F(x,  y))  =  (px  ■  <py 
G  (or  G(x,  y))  =  (py  ■  (py  =  ||0V 


2 
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where  8  is  the  angle  between  <fix  and  <fiy.  Hence 


cos  8  = 


4>x  ■  <P 


y 


\\M  ■  My 


and 


Mx  X  <py\\  = 


Mxf\\ 0yl|2(1  -  COS2  5) 


MxfMyWj 


1  - 


Mx  ■  <t>y ) 


ll<M2ll<V2 

UxtUyt  ~MxMy)2  =  EG 


) 


,2,1  /  m2  /  /  /  x2 


-F4 


This  gives  the  following  useful  formula  for  surface  area 


A(S)  = 


yj . EG  —  F 2  dxdy . 


Figure  11.2  shows  how  E ,  F  and  G  quantify  the  distortion  of  a  rectangle  by  the 
parametrization.  The  stretching  or  contraction  of  the  sides  is  measured  by  E  and  G 
while  F  measures  the  change  in  angle.  Thus  we  see  that  shape  is  preserved  if  E  =  G 
and  F  =  0  while  (relative)  area  is  preserved  if  EG  —  F 2  is  constant.  For  many 
important  parametrizations,  including  geographical  and  spherical  polar  coordinates, 
F  =  0.  This  implies  that  angles  between  curves  are  preserved  and,  in  particular, 
parallels  (of  latitude)  and  meridians  (of  longitude)  cross  one  another  at  right  angles. 
For  geographical  coordinates  on  a  sphere  of  radius  r,  E  =  r2  and  G  =  r2  cos2  0  and 
hence  neither  shape  nor  area  are  preserved.  On  the  Equator,  where  0  =  0,  we  have 
E  =  G  =  r2  but  as  one  moves  towards  the  North  and  South  Poles,  0  — >  ±7r/2  and 
G  — >  0  while  E  =  r2 .  Consequently,  near  the  Equator  shape  is  fairly  well  preserved 
but  as  one  moves  towards  the  polar  regions  it  becomes  more  and  more  distorted. 
Mercator’s  projection  is  a  modification  of  geographical  coordinates — notice  how  the 
lines  of  latitude  in  Fig.  10.13  are  not  equally  spaced  in  order  to  preserve  shape.  If  the 
cylinder  in  Mercator’s  construction  is  replaced  by  a  cone  we  obtain  Lambert’s  equal 
area  projection  which  preserves  (relative)  area  but  distorts  shape  and  distance. 

In  Chap.  10  we  noted  that  many  well-known  surfaces  S  could  be  written  in  the 
form  S  =  S\  U  T  where  Si  is  a  simple  surface  and  T  is  a  curve  parametrized  by  a 
smooth  function  on  a  closed  interval.  We  see  now  that  the  surface  area  of  T  is  zero 
and  from  this  conclude  that  A  (S)  =  A  (Si)  and  thus  we  may  use  a  parametrization  of 
Si  to  calculate  the  surface  area  of  S.  To  show  A(F)  =  0  enclose  T  in  a  finite  union 
of  rectangles  of  small  width  s  (Fig.  11.3). 

The  sum  of  the  lengths  of  the  rectangles  is  approximately  the  length  of  T,  /(T). 
Hence 

A(T)  ^  e  x  l(T). 
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Since  s  is  arbitrarily  small  this  implies  A( T)  =  0.  This  intuitive  “proof”  can  be 
developed  into  a  rigorous  proof  using  the  Mean  Value  Theorem  for  differentiable 
functions  of  one  variable.  A  differentiable  parametrization  of  the  curve  is  necessary , 
although  our  intuition  might  suggest  otherwise,  since  there  is  a  famous  example  of 
a  square  filling  curve  parametrized  by  a  continuous  function. 

Example  11.1  In  this  example  we  consider  the  surface  of  revolution  S  parametrized 
as  in  Example  10.4.  Let  P :  [a,  h ]  — >►  Mr  denote  a  parametrization  of  the  curve  to  be 
revolved.  Our  parametrization  0  of  S  is  defined  on  U  =  {(t,  0)  :  a  <  t  <  b,  0< 
6  <  2tx  }  and  has  the  form 


=  (x (t),y(t)  cos  6,y(t)  sintf) 

where  P(t)  =  (x(t),  y(t)).  We  have  already  shown  in  Example  10.4  that 


\\<Pt  x  ||  =y(t)  \\P\t)  || 


and  so  the  surface  area  of  S  is 


2tt  b  /  2n  \  I  b 

y(t)\\P'  (t)\\dtd0  =  IfdO |  If  y(t)\\P'(t)\\dt 

0  a  \0  /  \a 

b 

=  27 t  J y(t)\\P'(t)\\dt. 

a 


For  the  cone  of  height  /zand  (base)  radius  r,P(t)  =  ( ht ,  rt),P\t)  =  ( h ,  r),  HE7  (Oil  = 
( h 2  +  r2)1//2  and  the  curved  surface  area  is 


2  */V +  *>)■*,"  = 


2nr{h2  +  r2)1/2 


tdt  =  Ttr(h2  +  r2)1/2, 


0 


For  the  cylinder  of  radius  r  and  height  h  we  have  P(t)  =  (t,  r),  Pr(t)  =  (1,0)  and 
IIP7  (Oil  =  1.  Hence  the  curved  surface  area  of  the  cylinder  is 
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h 


2tx 


r  •  1  dt  =  2tx  rh. 


0 


For  the  sphere  of  radius  r,  P(t)  =  (r  cosf,  r  shu),  P\t)  =  (— rsiiU, 

r  cos  t)  and  \\P'(t)  ||  =  r  and  the  surface  area  of  the  sphere  of  radius  r  is 


2n  /  r  sin  t  •  rdt  =  27tr2(—  cos  t) 


Tt 

—  4nr2. 


o 


We  now  parametrize  the  torus  by  realising  it  as  a  surface  of  revolution.  We  adopt  a 
slightly  different  approach  to  that  in  Example  10.4  in  order  to  obtain  what  is  regarded 
as  the  standard  parametrization.  Place  a  circle  of  radius  r  in  the  (y,  z) -plane  with 
centre  on  the  y-axis  at  a  distance  b,  r  >  b,  from  the  origin  and  revolve  this  circle 
about  the  z-axis  (Fig.  1 1.4). 

The  coordinates  of  a  typical  point  on  the  original  curve  in  the  (x,  z) -plane  are 
(b  +  rcosO,  rsinO).  When  rotated  about  the  z-axis  the  third  coordinate  remains 
unchanged  while  the  first  two  coordinates  describe  a  circle  of  radius  b  +  r  cos  0 
about  the  origin.  This  gives  a  parametrization/  with  formula 

f(Q,  /)  =  ((/?  +  r  cos  0)  cos  /,(/?  +  r  cos  0)  sin  r  sin  0 ) 


and  domain  (0,  27r)  x  (0,  27r).  Hence 


fo  =  (— r  sin  0  cos  — r  sin  6  sin  r  cos  0) 
ff  =  (—(/?  +  r  cos0)  sin  (Z?  +  r  cos0)  cos  0 


Fig.  11.4 
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By  inspection  we  see  that  fo  and  ff  are  perpendicular  and  hence  F  =  0.  We  have 

E  =  \\fo\\2  =  r 2  sin2  6  cos2  x/r  +  r2  sin2  0  sin2  x/r  +  r2  cos2  0 
=  r 2  sin2  0(cos2  if/  +  sin2  xjr)  +  r2  cos2  0 
=  r 2  sin2  0  +  r2  cos2  0  =  r2 


and 


G  =  \\fyff  \\2  =  (b  +  r  cos  0)2  sin2  xfr  +  (b  +  r  cos  0)2  cos2  x/s 
=  (b  +  rcosO)2. 

Hence,  V EG  —  F2  =  r(b  +  r  cos  6),  and 


2tc  In 


A  (Torus)  =  j  j  VEG  -  EdOdf 


0  0 
In  2n 


r(b  +  r  cos  0)d0dx// 


0  0 

2n  \  /  2n 


dxf/ 

,0  /  \0 
=  4Tt2rb. 


r(b  +  r  cos  0)d0 


Some  geometric  insight  might  have  led  you  to  this  answer  without  any  calculations 
or  integration  (see  Example  14.5).  The  coordinates  (6,  xfr)  for  the  torus  defined  above 
are  called  toroidal  polar  coordinates. 
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Example  11.2  We  wish  to  find  the  surface  area  of  the  portion  of  the  paraboloid 
z  =  4  —  x2  —  y2  that  lies  above  the  xy-plane.  This  is  the  graph  of  the  function 
/(x,  y)  =  4  —  x2  —  y2  and  we  could  find  a  parametrization  using  the  method  in 
Example  10.3.  We  prefer,  however,  to  take  a  more  geometric  approach. 

The  standard  paraboloid  z  =  x2  4- y2  can  be  sketched  by  noting  that  cross-sections 
parallel  to  the  xy-plane  are  circles  (Fig.  11.5a).  This  surface  is  turned  upside  down 
by  taking  z  =  — x2  —  y2  and  then  moved  up  4  units  in  the  direction  of  the  z-axis  to 
give  us  the  original  surface  z  =  4  —  x2  —  y2  (Fig.  11.5b). 

The  surface  cuts  the  xy-plane  when  0  =  z  =  4  —  x2  —  y2,  i.e.  on  the  circle 
x2  T  y2  =  4.  Clearly  the  geometry  shows  that  we  can  project  the  surface  in  a 
one-to-one  fashion  onto  the  disc  x2  +  y1  <  4  and  if  we  reverse  this  we  obtain  the 
parametrization 

(x,  y)  e  {x2  +  y2  <4} - >  (x,  y,  4  —  x2  —  y2)  e  Paraboloid. 


We  could  proceed  directly  to  use  this  parametrization  to  compute  the  surface  area. 
This  would  require  a  change  of  variable  in  working  out  the  double  integral.  It  is 
just  as  easy  to  initially  use  a  more  appropriate  parametrization  which  avoids  a  later 
change  of  variable. 

We  note  that  the  domain  of  the  above  parametrization  is  a  disc  and  the  formula  for 
the  parametrization  function  involves  x2  +  y2 .  Either  of  these  on  their  own  suggest 
polar  or  spherical  coordinates.  Since,  however,  the  disc  shape  and  x2+y2  only  involve 
the  first  two  coordinates  this  inclines  us  towards  polar  coordinates  for  the  pair  (x,  y). 
We  use  the  parametrization 

/ :  (r,  0) - >  (r  cos  0,  r  sin  0,4  —  r2),  0  <  r  <  2  and  0  <  0  <  2n. 

We  have  fr  =  (cos  6,  sin0,  —  2r),f$  =  (— r  sin  6,  r  cos  0,  0).  Hence  E  =  1  +  4r2, 
G  =  r2  and  since  fr  -fo  =  0  we  have  F  =  0.  The  surface  area  is 


2  2tc 


2  2n 


f  f  =  f  [  rJTT&MO 


0  0 


0  0 


z 

=  2tt  J  r\J  1  +  4 r2dr 


0 


2n 

~8~ 


u 


l^2du,  1  +  4r2  =  u,  8 rdr  =  du 
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Example  11.3  We  wish  to  find  in  this  example  the  surface  area  of  that  portion  of  the 
sphere  v2  +  y2  +  z2  =  4 z  which  lies  outside  the  paraboloid  x2  +  y2  =  3z- 

For  this  problem  we  first  get  some  idea  of  the  geometry  and  see,  once  this  is 
reasonably  clear,  that  the  parametrization,  the  domain  of  the  parametrization  and 
even  the  integration  are  relatively  straightforward. 

We  have  two  surfaces,  a  sphere  and  a  paraboloid.  Rewriting  the  equation  for  the 
sphere  by  completing  the  square  we  get 

X2  +y2  +z2  -  4z  +  4  =  4 
x2 +y2 +  (z-2)2  =  22 

and  this  is  the  sphere  of  radius  2  with  centre  (0,  0,  2)  (Fig.  1 1.6). 

If  we  take  the  cross-section  of  the  paraboloid  corresponding  to  the  plane  z  =  c 
we  get  a  circle  of  radius  >/3 ~c  and  so  we  can  sketch  the  paraboloid  (Fig.  11.7).  The 
points  of  intersection  of  the  two  surfaces  are  identified  by  letting 


Fig.  11.6 


Fig.  11.7 
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Fig.  11.8 

x2  +  y2  +  z2  -  4z  =  0  =  x2  +  y2  -  3 z. 

This  implies  z2  —  4z  =  —3 z,  i.e.  z2  —  z  =  0  and  z  =  1  or  z  =  0.  When  z  =  0  we 
get  the  point  (0,  0,  0)  on  both  surfaces  and  when  z  =  1  we  get  the  curve  x2  -\-y2  =  3, 
i.e.  a  circle  of  radius  >/3.  This  usually  indicates  where  one  surface  crosses  over  from 
the  inside  to  the  outside  of  the  other  and  gives  us  the  working  diagram  (Fig.  11.8). 

To  find  when  the  sphere  is  outside  the  paraboloid  it  suffices  to  compare  the  points 
(x,y,  zp)  and  (x,  y,  zs)  on  the  paraboloid  and  sphere,  respectively,  which  project  onto 
the  same  point  (x,  y ,  0)  in  the  (x,  y)  -plane.  From  the  defining  equations  of  the  surfaces 
we  have 


x2  T  y2  —  4zs  ~  z2  =  3zp- 

Hence 

zs  -  z2  =  3zp  -  3 zs,  i.e.  Zj(l  -  zs)  =  3 (zp  -  zs). 

Since  ^  >  0  we  have  zp  >  if  and  only  if  1  —  ^  >  0,  i.e.  when  0  <  <  1 .  Thus  the 

part  of  the  sphere  which  lies  outside  the  paraboloid  is  precisely  that  portion  which 
lies  between  the  parallel  planes  z  =  0  and  z  =  1 . 

We  thus  have  to  find  the  surface  area  of  the  sphere  x2  -\-y2  -\-(z  —  2) 2  =  22  between 
the  planes  z  =  0  and  z  =  1.  We  use  the  formula  for  spherical  polar  coordinates 
obtained  in  Example  10.5  translated  so  that  the  origin  is  at  the  point  (0,  0,  2),  i.e.  let 

x  =  2  sin  ^  cos  y  =  2  sin#  skn/f,  z~2  =  2cos0 

and  it  is  now  only  necessary  to  find  the  domain  of  the  parametrization.  Since  0  < 
z  <  1  we  consider  —2  <  z  —  2  <  —1,  i.e.  —2  <  2cos^  <  —  1.  This  implies  (see 
Fig.  10.14)  2zt /3  <  0  <  zt  and,  since  there  is  no  restriction  on  0  <  x/s  <  2zt. 
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Denote  this  parametrization  by/.  Then 

fo  =  (2  cos  0  cos  /,  2  cos  0  sin  /,  —2  sin  6) 


and 


Hence 


//  =  (— 2  sin  0  sin  /,  2  sin  0  cos  \js,  0) . 


E  =  4  cos2  0  cos2  0+4  cos2  0  sin2  0+4  sin2  0 
=  4(cos2  0  +  sin2  0)  cos2  0  +  4  sin2  0 
=  4  cos2  6  +  4  sin2  6  =  4 


and 


G  =  4  sin2  0  sin2  /  +  4  sin2  0  cos2  /  =  4  sin2  0. 


Since  fe  -ff  =  0,  F  =  0  and  the  surface  area  is 


27T  it 


2jt 


TV 


J  J  2-2sin OdOch/r  =  (4  J  J  sin OdO^j 


0  2tt/3 


0 


2tt/3 
7 r 


=  87r(—  COS0) 


=  8tt(1 


2tt/3 


1/2)  =  4;r. 


Example  11.4  We  calculate  the  surface  area  of  the  level  set  x2  —  2y  —  2z  =  0  which 
projects  onto  the  region  U  of  the  xy-plane  bounded  by  the  lines  x  =  2,  y  =  0  and 
y  =  4x.  On  the  surface  S  we  have  z  =  ^ x  —  y  and  the  surface  is  the  graph  of  the 

function  g(x,  y)  =  ^x2  —  y.  Using  Example  10.3  we  obtain  the  parametrization 

1  9 

0 :  (x,  y)  €  U - >  (x,  y,  -x  -  y) 


and,  moreover, 


\\<Px  X  M  =  ||(-g„  -gy,  1)11  =  \\(x,-l,  1)||  =  (x2  +  2)*/2. 
Note  that  only  part  of  the  surface  lies  above  the  xy-plane.  Our  surface  area  is 


JJ  (x2  +  2)1//2  dxdy. 


u 


1 1  Surface  Area 


131 


Fig.  11.9 


We  integrate  first  with  respect  to  y  and  note,  from  Fig.  11.9,  that  y  varies  from  0 
to  4x  and  afterwards  integrate  with  respect  to  v  which  varies  from  0  to  2.  We  have 


2  4x 


j  j  (. x 2  +  2)1/2  dxdy  =  J  j  (x2  +  2  )l^2dy 


u 


dx 


0  0 
2 

=  J  (x2  +  2) 'A 

0 


4x 

0 


z 

&=/W  + 


0 


w3/2  4 


=  2  /  ul/2du  ~  2 - =  -(v2  +  2)3/2 

J  3/2  3 

4  /  a/o  a  /o\  8 

3  V  /  3  W2 


where  w  =  x2  +  2,  du  =  2v<iv. 


Exercises 

11.1  Calculate  the  surface  area  of  the  paraboloid  z  =  x2  +  y2  which  lies  between 
the  planes  z  =  0  and  z  =  4. 

11.2  Find  the  area  of  the  surface  generated  by  revolving  about  the  v-axis  the  curves 

(a)  y  =  x3,  0  <  v  <  1, 

(b)  )'  =  v2.  0  <  v  <  1. 

11.3  Show  that  the  area  of  the  helicoid  defined  by  the  parametrization 
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P(r,  0)  =  (r  cos  0,  r  sin0,  6),  0  <  6  <  2tt ,  0  <  r  <  1 

is  7 x  (\/2  +  log(l  +  \/2)) . 

1 1.4  If  t/  is  open  in  M2  and  / :  f/  — >►  R  is  a  smooth  function  show 


Area  (graph)/))  =  //  ( 1  +  ||Vf||2)1/2fiMy 


11.5  Find  the  surface  area  of  the  paraboloid  in  Example  11.3  that  lies  inside  the 
sphere. 

11.6  Find  the  area  of  the  portion  of  the  surface  z  =  xy  which  lies  inside  the  cylinder 

9  9  9 

9T  +  =  aL . 

11.7  Use  the  parametrization  (r,  0)  — >  (r2/16,  rcos0,  r  sin^)  to  find  the  area  of 
the  part  of  y2  +  z2  =  16x  which  lies  in  the  first  octant  (x  >  0,  y  >  0,  z  >  0) 
between  the  planes  x  =  0  and  x  =  12  and  inside  the  cylinder  y 2  =  4x. 

11.8  Let  0 :  £/  — >►  5  denote  a  parametrization  of  the  simple  surface  5  in  M3.  Give 
some  intuitive  reasons  why  0  preserves 

(a)  angles  if  F  =  0 

(b)  relative  area  if  EF  —  G 2  is  constant 

(c)  shape  if  E  =  G  and  F  =  0. 

1 1 .9  Use  the  Cauchy-Schwarz  inequality  (Example  3.4)  to  show  that  EG  —  F2  >  0 
where  E,  F  and  G  are  calculated  from  any  parametrization  of  a  surface. 

11.10  Let  T  denote  a  directed  curve  in  a  surface  parametrized  by  a  mapping  0.  Show 
that  T  has  a  parametrization  of  the  form  P  :  [«,/?]  — >  Y,P(t)  =  cp(a(t), 
where  a  and  /  are  real  valued  functions.  Show  that 


b 

/( O  =  J  y£(P(f))a'(f)2  +  2F(P(/))a,(/)j8'(r)  +  G(P(r))/'(r)2  dt. 


a 


(see  Table  11.1) 
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Table  11.1  Useful  parametrizations 

1.  Sphere  x2  -hy2  -hz2  =  r2;  geographical  coordinates 

/(#,  \jz)  —  ( r  cos  #  cos  xf,  r  cos  #  sin  xf,  r  sin  #),  —n/2  <  6  <  7t/2,  0  <  xf  <  2tt 

fe  =  (— r  sin#  cos  x//,  —r  sin#  sin  xf,  r  cos#),  =  (— r  cos  #  sin  x//,  r  cos  #  cos  xf,  0) 

fe  x  —  —r2  cos  # (cos  #  cos  x//,  cos  #  sin  xf,  sin  #) 

E  =  r2,  F  =  0,  G  =  r2  cos2  6,  |[ fe  xfy\\  =  ~JEG  -  F2  =  r2  cos  6 
n  =  —(cos  #  cos  x/f,  cos  #  sin  xj/,  sin  #) 

2.  Sphere  X2  +y2  +z2  =  r2;  spherical  polar  coordinates 

/(#,  =  (r  sin  #  cos  x/r,  r  sin  #  sin  xj/,  r  cos  #),  0  <  #  <  7T,  0  <  x//  <  2tt 

fe  =  (r  cos  #  cos  xf/ ,  r  cos  #  sin  xj/,  — r  sin  #) ,  =  (— r  sin  #  sin  xj/,  r  sin  #  cos  xf/,  0) 

fe  xfy  =  r 2  sin  # (sin  #  cos  x/f,  sin  #  sin  t/'",  cos  #) 

£  =  r2,  F  =  0,  G  =  r2  sin2  9,  \ \fy  xfy\\  =  -JEG  —  F2  =  r1  sin 6 

n  =  (sin  #  cos  xj/,  sin  #  sin  x/z,  cos  #) 

3.  Cylinder  x2  +  y2  =  r2,  0  <  z  <  h;  cylindrical  polar  coordinates 

/(#,  z)  =  (rcos#,  r  sin#,  z),  0  <  #  <  2tt,  0  <  z  <  h 

fe  =  (-r  sin#,  rcos#,  0),  fz  =  (0,  0,  1) 
fe  xfz  =  ( r  cos  #,  r  sin  #,  0) 

E  =  r2,  F  =  0,  G=  1,  ||/e  x/z||  =  s/EG  -  F2  =  r 

n  =  (cos#,  sin#,  0) 


4.  Inverted  cone  xz  +yz  =  zz,  0  <  z  <  a 


f(r,  #)  =  (r  cos  #,  r  sin  #,  r),  0  <  r  <  a,  0  <  6  <2n 

fr  =  (cos #,  sin#,  1),  fe  =  (— r  sin#,  rcos#,  0) 
frxfe  =  (-rcos#,  — r sin#,  r) 

£  =  2,  F  =  0,  G  =  r2,  \\fr  Xfe  II  =  VFG  -  F2  =  V2r 
n  =  — -^(cos0,  sin  /  —1) 


(Continued) 
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Table  11.1  (Continued) 

5.  Torus — rotate  circle  of  radius  r  in  yz- plane,  centre  (b,  0),  b  >  r, 
about  z-axis;  toroidal  polar  coordinates 

f(0,  \jr)  =  (jjb r  cos  6)  cos  f,  (b  +  r  cos  0)  sin  r  sin  0) ,  0  <  0,  <  2tt 

fe  =  (— r  sin  0  cos  xjr,  —  r  sin  0  sin  f,  r  cos  6) , 
ff  =  (—(£>  +  r  cos  6)  sin  f,  (b  +  r  cos  0)  cos  0) 
fe  x  =  —r(b  +  r  cos  #)(cos  0  cos  cos  0  sin  sin  6) 

E  =  r2 ,  F  =  0,  G  =  (b  +  r  cos  #)2 ,  |[/g>  x.fy\\  =  VFG  —  F2  =  r(b  +  r  cos  0) 

n  =  —(cos  0  cos  cos  6  sin  sin  6) 

6.  Graph  of/:  f  c  l2  ^  1 

F(x,y)  =  (x,y,f(x,y)),  (x,y)  e  U 
Fx  =  (1,  0,/c),  Fy  =  (0,  l,/y),  FxxFy  =  (~fx,  -fy,  1) 

£  =  1  +/x2,  £  =fjy,  G  =  1  +//,  II  Fx  X  £y||  =  V£G  -  £2  =  /  +/2  +// 

n_  (-/„-/>,!) 

/  +/x  +// 

7.  Surface  of  revolution  of  T  parametrized  by  F(t)  =  ((x(f),  y(0)>  a  <t  <b, 
y(t )  >  0  for  all  t,  ||P'(f)||  -=f  0 

/(t,  0)  =  (jc(f),  y(t)  cos  0,  y(t)  sin  0),  a<t<b,  0<6<2jt 
ft  =  (x'(t),  y'(t)  cos  9,  y'(t)  sin  0) ,  fe  =  (0,  -y(t)  sin  0,  y(t)  cos  0) 
ft  *fe  =  (y'(t)y(t),  -x'(t)y(t)cos6,  -x'(t)y(t)  sin6») 

F  =  \\P'(t)  ||2,  F  =  0,  G  =  y(t)2,  \\ft  xfe\\  =  y/EG  -  F2  =  y(t)\\P'(t)\\ 
n  =  ]^f(/(0-  -*'(f)cos0,  —x'(t)  sin#) 


Chapter  12 

Surface  Integrals 


Summary  We  define  the  integral  of  a  vector  field  over  an  oriented  surface.  Geomet¬ 
rical  interpretations  are  discussed. 

Integrals  are  used  to  measure  quantities  such  as  length,  area,  expected  value,  etc.,  and 
as  with  all  measurements,  there  has  to  be  a  unit  of  measurement.  Our  basic  unit  of 
measurement  in  integration  theory  over  R  is  obtained  by  assigning  the  value  1  to  the 
rectangle  of  height  1  over  an  interval  of  length  1  measured  from  left  to  right.  From 
this  we  are  able  to  define  Riemann  sums  and  afterwards  the  Riemann  integral  of  a 
continuous  function  over  a  closed  interval.  The  inclusion  of  “left  to  right”  is  crucial 
for  without  it  we  would  have  an  ambiguous  definition — and  to  a  mathematician  an 
ambiguous  definition  is  no  definition.  To  emphasis  this  point  we  call  an  interval  [a ,  b] 
directed  from  left  to  right  a  positive  interval  in  R. 

Now  suppose  T  is  a  curve  in  Mr  and,  for  the  sake  of  simplicity,  we  suppose  that  T  is 
not  closed  and  that  we  are  interested  in  defining  the  integral  of  a  function  over  T.  We 
have  seen  in  Chap.  7  that  we  can  define  two  integrals  over  T  since  T  can  be  directed 
in  two  different  ways  and  we  have  to  decide,  before  doing  any  calculations,  which 
interests  us.  We  may  think  of  a  curve  as  having  two  sides — one  for  each  direction  of 
motion — like  two  escalators  side  by  side,  one  going  up  and  the  other  coming  down 
(Fig.  12.1).  We  call  one  side  positive  and  the  other  negative.  If  we  are  interested  in 
evaluating  integrals  over  {T,  A,  B]  we  call  T  directed  from  A  to  B  the  positive  side. 

We  use  a  parametrization  to  transfer  the  integral  over  {T,  A,  B]  to  an  integral 
over  an  interval  in  R  which  we  subsequently  evaluate.  However,  the  parametrization 
must  be  directed  correctly.  This  means  that  it  must  map  a  positive  interval  in  R  onto 
the  positive  side  of  T.  Surface  integrals  are  defined  in  the  same  way — just  step  up  a 
dimension. 

Consider  the  unit  sphere  x2  +  y2  +  z2  =  1  in  M3.  It  has  two  sides — an  inside  and 
an  outside.  More  generally,  the  surface  S  which  is  the  level  set  {X  e  U  :  /(X)  = 
0  and  V/(X)  7^  0}  has  ±V/(X) / ||  V/(X)||  as  unit  normals  at  each  point  and  we 
may  consider  V/ ( X ) / 1|  V/ (X)  ||  and  — V/ (X) / 1|  V/ (X)  ||  as  lying  on  different  sides. 
We  distinguish  between  the  sides  of  a  surface  by  using  normals. 
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Fig.  12.1 

Definition  12.1  An  oriented  surface  in  M3  is  a  pair  (S,  n)  where  S  is  a  surface  and 
n  is  a  continuous  mapping  from  S  into  M3  such  that  n (P)  is  a  unit  normal  to  the 
surface  S  at  P. 

We  will  also  use  S  to  denote  an  oriented  surface.  The  appearance  of  S  signals  that 
we  have  fixed  an  orientation  on  the  underlying  surface  S.  The  notation  S  is  simpler 
but  if  we  need  to  specify  the  normal  or  if  there  is  any  possibility  of  confusion  we 
write  (S,  n). 

Clearly  if  ( S ,  n)  is  an  oriented  surface,  then  ( S ,  — n)  is  also  an  oriented  surface  and 
the  continuity  requirement  in  the  choice  of  normal  implies  that  a  connected  surface 
can  have  at  most  two  orientations .  There  exist,  however,  surfaces  which  do  not 
admit  any  orientation  and  integration  theory  cannot  be  defined  over  such  surfaces. 
Fortunately,  every  simple  surface  admits  an  orientation.  If  0 :  U  C  R2  -*  5  is  a 
parametrization  of  S  then  (.S',  <px  x  <py/\\(px  x  <py  ||)  and  (.S',  —<px  x  <f>y/\\4>x  x  <py  ||) 
are  two  oriented  surfaces  associated  with  S  and  if  S  is  connected  these  are  the  only 
two  oriented  surfaces  associated  with  S.  Given  an  oriented  surface  (S,  n)  we  call  the 
side  of  S  containing  n  the  positive  side  and  the  side  which  contains  —  n  the  negative 
side.  In  the  case  of  a  simple  oriented  surface  (S,  n)  a  parametrization  0  is  said  to  be 
consistent  with  the  orientation  if 


0X  X  0y 

- —  =  n. 

110*  x  4>y\\ 


If  F  is  a  continuous  vector  field  on  a  simple  oriented  surface  S  we  define 


F 


by  using  a  parametrization  (0,  U)  consistent  with  the  orientation  and  a  process 
involving  Riemann  sums  similar  to  that  used  to  define  surface  area  in  the  previous 
chapter  (see  Figs.  11.2  and  11.3  for  reference).  We  find  it  convenient  to  use  ( u ,  v) 
for  the  variables  in  the  domain  of  0  and  (x,  y,  z)  for  the  range,  i.e.  v  =  x(u,  v), 
y  =  y(u,v)  and  z  =  z(u,v).  We  begin  by  partitioning  the  domain  U  into  small 
rectangles  and  consider  a  typical  rectangle  R  in  this  partition.  Now,  however,  instead 
of  taking  the  absolute  area  of  0  (R)  we  take  the  ( signed)  volume  of  the  parallelepiped 
with  base  0(R)  and  height  determined  by  F  (see  Fig.  12.2). 

We  use  n  as  our  positive  unit  of  measurement  perpendicular  to  the  surface.  This  is 
reasonable  since  we  are  just  considering  a  small  portion  of  the  surface  0(R),  which 
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Fig.  12.2 


Fig.  12.3 

lies  approximately  in  the  tangent  plane,  and  n  (0  (u ,  v))  is  perpendicular  to  the  tangent 
plane  at  v).  The  height  of  the  parallelepiped  is  ||F||  cos  x/r  =  F  •  n  where  if/  is 
the  angle  between  F  and  n  at  v)  (see  Fig.  12.3). 

We  have  already  seen,  in  calculating  surface  area,  that  the  area  of  <p(R),  the  base 
of  the  parallelepiped,  is  \\</>x  x  <py  ||  Ax  Ay  and  hence  the  (signed)  volume  is 

(F  •  n)||0M  x  0V||  AuAv  =  (F  •  (pu  x  (pv)AuAv. 

Now  taking  the  limit  of  Riemann  sums  in  the  usual  way  we  define 


(F  •  n)  \\<pu  x  0V || dudv 


s  u 


4>u  x  <fiv)dudv. 


From  this  analysis  we  expect  the  volume  of  the  solid  one  unit  high  on  the  positive 
side  of  the  surface  to  equal  the  total  surface  area  and  to  be  obtained  by  integrating 
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the  vector  field  n  over  S.  Indeed,  we  have 


// 


n  = 


// 


(n  •  n) || <J)U  x  <fiv\\dudv  =  Surface  area  ( S ). 


(S,n) 


u 


It  is  now  natural  to  define  the  surface  integral  of  a  scalar-valued  function  f  on  (S,  n) 
by  identifying  /  with  / n,  i.e. 


(5,n)  (S,n) 


S 


where  0  is  a  parametrization  consistent  with  the  orientation.  This  is  analogous  to  the 


way  we  defined,  in  Chap.  7,  the  integral  of  a  scalar- valued  function  /  along  a  curve 
by  identifying  /  with  fT.  Although  we  needed  an  orientation  to  define  this  integral 
it  is  easily  seen  that 


/ 


(5,11)  (5,-n) 


and  the  value  of  the  integral  is  independent  of  the  orientation.  This  might  suggest 


that  it  is  possible  to  integrate  a  scalar- valued  function  over  an  arbitrary  surface  (see 


Definition  10.6).  This  is  not  true.  What  is  required  is  an  orientable  surface ,  i.e. 
a  surface  which  can  be  oriented.  Simple  surfaces  are  orientable.  This  means  that 
we  can  use  any  parametrization  to  integrate  a  scalar- valued  function  over  a  simple 
surface. 

If  the  vector  field  F  is  represented  by  vectors  emanating  from  the  surface  and  if 
these  all  lie  on  the  same  side  (of  the  surface)  as  n  then  the  integral  of  F  over  (S,  n)  is 
non-negative.  On  the  right-hand  diagram  in  Fig.  12.2  we  see  that  {fu,  <fiv,  n}  follows 
the  right-hand  rule  and  note  that  F  and  n  lie  on  the  same  side  of  (S,  n)  if  and  only 
if  the  angle  between  them  lies  in  [— jr/2,  n/2]. 

Example  12.2  We  evaluate  ff  F  where  S  is  the  sphere  of  radius  r  with  centre  at 


(S,n) 


the  origin  oriented  outwards  and  F (x,  y,  z)  =  (x,  y,  z).  Although  a  sphere  is  not 
a  simple  surface  we  can  treat  it  as  one  for  integration  theory  (see  our  remarks  in 
the  previous  chapter).  Here,  however,  we  do  not  need  any  parametrization.  The  unit 
normals  at  a  point  P  on  S  are  ±P/\\  P  ||  and  since  S  is  oriented  outwards 


Hence 


F (P)  n (P)  =  P  • 


P 


r 


2 


=  r 


r 


r 


r 
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Fig.  12.4 


and 


9  Q 

rdA  =  r  (Surface  Area  of  S)  =  r  •  47rr  =  47rr 


Example  12.3  We  compute 


-(2) 
V2az  -  z2 


where  S  is  the  part  of  the  surface  x2  +  y2  +  (z  —  a)2  =  a 2  which  lies  inside  the 
cylinder  x2  +  y2  =  ay  and  underneath  the  plane  z  =  a  oriented  with  outward 
pointing  normal. 

We  first  sketch  the  surface  over  which  we  are  integrating.  Initially  this  appears  as 
a  rather  formidable  task  but  by  sketching  each  part  separately  (Fig.  12.4)  and  then 
combining  (Fig.  12.5)  them  it  becomes  relatively  simple.  The  surface  x2-\-y2-\-(z  — 
a)2  =  a2  is  a  sphere  of  radius  a  with  centre  (0,0 ,  a).  The  equation  x2  +  y2  =  ay 
can  be  rewritten  as 

2  2 

99  a  a 

x  +  y  -ay  +  —  =  — 

i.e. 

*2  +  (y~l)2  = 

and  this  surface  is  the  cylinder  parallel  to  the  z-axis  above  the  circle  with  centre 
(0,  a/2 )  and  radius  a/2  in  the  vy-plane.  The  plane  z  =  cl  is  parallel  to  the  vy-plane 
and  a  units  above  it.  The  outward  normal  to  the  sphere  of  radius  a  with  centre  (0,0 ,  a) 
at  (x,  y,  z)  is  (x,  y,  z  —  a) /a.  Hence 

(*,  y,  z  —  a)  _  f  f  (*,  y,z-a)  (x,  y,z-  a) ^ A 
V2  az  —  z2  J J  V2  az  —  z2  a 

S  V 

a 

- dA. 

V2  az  —  z2 
s 


140 


12  Surface  Integrals 


z 


Fig.  12.5 


From  Fig.  12.5  we  observe  that  our  integral  is  over  part  of  the  lower  right-hand 
quarter  of  the  sphere.  We  use  geographical  coordinates  about  the  point  (0,  0,  a) 

F :  0 0 ,  x/r) - >  (< a  cos  0  cos  x/r,  a  cos  0  sin  \[r,  a  +  a  sinO). 

Since  6  refers  to  latitude  and  we  are  considering  the  lower  portion  we  have 
— 7t/2  <  0  <  0.  From  the  sketch  y  >  0  and  hence  sinx/s  >  0,  i.e.  0  <  xj/  <  tv, 
and  since  the  surface  lies  inside  the  cylinder  we  also  have  x2  +  y2  <  ay,  i.e. 
a2  cos2  0  <  a2  cos 6  sin  ifr.  Hence  cos#  <  sin  xj/  =  cos(^  —  j).  By  considering 
separately  the  cases  0  <  ^  <  f  and  ^  <  xjr  <  n  we  obtain  —  0. 

Now 

2  az  —  z2  =  —a2  +  2  az  —  z2  +  a2  —  a2  —  (a  —  z)2 
and  changing  this  into  our  new  coordinates  we  get 

yjlaz  —  z 2  =  ( a 2  —  a2  sin2  6) l^2  =  ( a 2  cos2  6) 1//2  =  a  cos  0 . 


From  Example  10.5  we  recall  that 


dA  =  JEG  -  F2d9df  =  a2  cos  OdOch/s. 


Hence 


(x,y,z  -  a) 
+Jlaz  —  z2 


•o 

-7T/2 


>7r/2  6  a 2  cos  0  •  a 


it  /  2+0 


a  cos  0 


d\j/ 


dO  =  ji2  a2 1 4 


An  alternative  approach  to  vector-valued  integration  is  possible  by  using  ori¬ 
ented  (coordinate)  planes  in  M3 .  In  oriented  planes  we  can  define  positive  and  neg¬ 
ative  area  and  an  anticlockwise  sense  of  direction.  The  idea  is  to  use  consistent 
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z 


Fig.  12.6 

parameterizations  to  transfer  integration  from  an  oriented  surface  in  M3  to  integra¬ 
tion  over  subsets  of  the  coordinate  planes  in  M3,  i.e.  the  xy- plane,  the  yz- plane  and 
the  vz-plane.  Each  of  these  planes  is  a  surface  in  M3  and  to  define  our  integral  we  must 
assign  a  positive  side  to  each  of  these  planes  for  the  same  reason  that  we  required 
positive  intervals  in  R. 

We  first  define  the  positive  unit  vectors  in  the  x,  y  and  z  directions  in  M3  as 
(1,  0,  0),  (0,  1,0)  and  (0,  0,  1).  These  choices  are  natural  in  view  of  the  way  we 
sketch  graphs  in  M2  and  M3.  Consider  now  the  vy-plane  in  M3  as  the  surface  defined 
by  /-1  (0)  where  f(x,y,z)  =  z.  The  unit  normals  to  this  surface  are  ±(0,  0,  1)  and 
we  must  choose  between  them  to  define  a  positive  side  to  the  vy-plane.  We  use  the 
positive  unit  vectors  in  the  x  and  y  directions  in  that  order  and  the  right-hand  rule 
or  cross  product  to  define  the  positive  side  of  the  vy -plane  as  that  oriented  by  the 
normal 

(1,  0,  0)  x  (0,  1,  0)  =  (0,  0,  1). 

We  denote  the  positively  oriented  vy-plane  in  M3  by  M2a.  y  Similarly  the  positive 
side  of  the  yz-plane  is  defined  by  the  normal  (0,  1,0)  x  (0,0,  1)  =  (1,0,0) 
and  denoted  by  ^  and  the  positive  side  of  the  zv -plane  is  defined  by  (0,  0,  1)  x 

(1,  0,  0)  =  (0,  1,  0)  and  denoted  by  M2„  xy 

In  Fig.  12.6  we  see  the  direction  of  rotation  (i.e.  rotation  to  the  left )  used 
to  measure  angles  in  each  of  the  coordinate  planes  and  also  the  direction  of  a  closed 
anticlockwise  oriented  curve. 

If  we  look  at  each  individual  coordinate  plane  we  get,  using  Fig.  12.6,  the  diagrams 
in  Fig.  12.7  showing  the  anticlockwise  directions. 

In  Chap.  9  we  discussed  Green’s  theorem 

U{d-£  -  d-£)dxdy  <12|) 

Q 


j)  Pdx  +  Qdy  = 


where  the  boundary  of  £2,  T  is  oriented  in  an  anticlockwise  direction.  From  (12.1)  it 
does  not  appear  that  £2  has  been  assigned  any  orientation  but,  in  fact,  if  we  identify 
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(a) 

y 


X 


Fig.  12.7 


(b) 


(C) 


Fig.  12.8 


M2  with  M~x  y)  C  M3  then  the  integration  on  M2  is  really  over  the  positive  side  of 

M2y  y  This  is  the  matching  up  of  orientations  in  Green’s  theorem. 

Let  F  =  (/,  g,  h)  denote  a  continuous  vector  field  on  the  simple  oriented  surface 
S.  Let 

0:  (u,  v)  e  U - >  (x(u,  v),  y(u,  v),  z(u ,  v)) 

denote  a  parametrization  consistent  with  the  orientation.  We  now  take  an  independent 

approach  to  defining  separately  ff(f,  0,  0),  ff(0,  g,  0)  and  ff(0,  0,  h). 

s  s  s 

First  observe  that  (f,  0,0)  is  parallel  to  the  v-axis  and  may  be  represented  as  in 
Fig.  12.8,  where  we  have  rotated  the  axes  but  preserved  the  orientation. 

Since  /  points  in  the  direction  of  the  normal  (1,  0,  0)  to  M2y  ^  it  is  natural  to 

project  onto  M2y  z)  and  to  define  ff(f,  0,  0)  as  ff  f  where  Py  z  is  the 

s  fw(S)cK^j 

projection  (x,y,z)  — >  (y,z)  and  Py  z( S)  is  given  the  induced  orientation  from 
R(y,z)  .  Although  the  set  Py,z( S)  may  not  be  open  and  the  mapping 


Py,z  ■  («.  v) - >  (y(u,  v),  z(u,  v)) 
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is  not  necessarily  a  parametrization  of  Py,z( S)  we  may  proceed  as  if  they  were. 
Partition  U  into  rectangles,  (Rij)ij •  We  consider  the  “boundary”  of  Py  z(Rij )  as  a 
directed  curve  in  ^  “parameterized”  by  Py?z  restricted  to  the  boundary  of  the 
anticlockwise  oriented  rectangle  Rij  in  the  wv-plane.  Hence  Pyz(Rij)  will  have 
positive  (or  at  least  non-negative)  area  if  it  is  oriented  in  an  anticlockwise  fashion  in 
2  ^ .  This  gives  us  the  approximation 


M2 


Area (Py>z  (/?,;))  inK(y,z)  % 


yu  Zu 

yv  zv 


(ui,Vj)AuiAvj. 


Next  we  transfer  the  partition  of  U  to  Py,z( S)  by  using  Py  z  and  0  and  form 
the  Riemann  sum  of  /  with  respect  to  this  partition.  This  analysis  shows  that  the 
Riemann  sum 


i  j 


Area  (. Py,z(Rij ))  in  M^z) 


is  approximately  equal  to 


i  j 


yu 

yv 


Zu 

Zv 


(uj,Vj)AujAvj 


and  on  taking  a  limit  we  get 


//(/.  0, 0, 

s 


JJ  f(4>(u,v)) 

u 


yu 

yv 


Zu 

Zv 


dudv. 


We  proceed  in  the  same  way  with  (0,  g,  0)  and  obtain  the  diagram  shown  in 
Fig.  12.9.  This  implies  that  z  precedes  v  (see  also  Fig.  12.7c)  and  so  the  correctly 
oriented  “parametrization”  in  this  case  is 

(m,  v) - >  (z(u,  v),  x(u,  v)) 


and  we  obtain 


//(».  * .  0, 

s 


%u 

Vy 


dudv . 


Similarly  we  are  led  to 


(0,  0,  h) 


s 


u 


Xu 

Xy 


yu 

yv 


dudv . 
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Fig.  12.9 


Although  each  of  these  integrals  looks  rather  complicated  when  we  add  them  together 
we  obtain  a  familiar  expression.  Since 

<p(u,  v)  =  (x(u,  v),  y(u,  v),  z(u,  v)) 


we  have  <pu  =  (x„,  yu,zu )  and  <pv  =  (xv,  yv,  zv)-  Hence 


0M  X  0v  — 


1  J  K 

Xu  yu  Zu 

yv  zv 


(I 


yu  Zu 

yv  zv 


Xu  Zu 
Xv  Zv 


Xu  yu 
Xv  yv 


I) 


F  -<pu  X  0v  =  / 


and 


yu 

Zu 

~  g 

Xu 

Zu 

+  h 

Xu 

yu 

yv 

Zv 

Xv 

Zv 

Xv 

yv 

f 

g 

h 

/F  \ 

Xu 

yu 

Zu 

=  det  | 

Xv 

yv 

Zv 

U  J 

(/,0,0)+  JJ  (0,g,0)+  (0,0,  A) 

s  s  s 


/ 


yu  Zu 

yv  zv 


+  g 


Zu  Xu 


Zv  X 


v 


+  h 


Xu  yu 
Xv  yv  \) 


dudv 


u 


= IS mu- v,)  •  *- x 


u 


=  det  I  <pu  |  dudv. 
u  \  0V 


(12.2) 


(12.3) 


(12.4) 
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Formula  (12.3)  is  our  original  definition  of  the  integral  of  F  over  S  and  thus  (12.2) 

and  (12.4)  are  new  expressions  for  ff  F.  We  have  also  shown,  provided  we  use  the 

s 

correct  orientations  on  Pyz(S),  PZX(S)  and  Px,y( S),  that 


In  certain  cases  these  projections  map  S  onto  geometrically  nice  domains  in  the 
coordinate  planes  and  this  may  lead  to  simpler  calculations  -see  for  instance  our  next 
example,  Example  13.2  and  Exercises  12.1,  12.2  and  12.8. 

From  (12.3)  and  the  results  of  the  previous  chapter  we  also  have 


(F  •  n)  \\<pu  x  0V|| dudv 


s  u 


(F  •  n)  JeG  —  F2dudv. 


Using  the  notation 


d(x,  y ) 

d(u,  v) 


xu  yu 
*v  yv 


etc. 


(12.5) 


we  obtain  yet  another  formula  for  the  integral: 


/d(u  y) 

\d(u,v) 


d(z,x) 

d(u,v) 


dudv. 


(12.6) 


In  view  of  formula  (12.5)  the  notation  ff  (F,  n )dA  is  also  used  in  place  of  ff  F 

5  (V,n) 

where  dA  denotes  surface  area. 

We  have  established  five  formulae  for  calculating  surface  integrals.  Furthermore, 
our  excursion  into  oriented  planes  has  led  to  geometric  insights  on  the  construction  of 
the  integral  and  to  a  method  of  evaluating  integrals  by  projecting  onto  the  coordinate 
planes.  There  are  still  a  number  of  topics  to  be  sorted  out,  e.g.  independence  of  the 
parametrization,  and  we  discuss  these  in  examples  as  we  proceed. 

Example  12.4  In  this  simple  example  we  use  projections  to  calculate  the  area  of  the 
triangle  with  vertices  (1,  0,  0),  (0,  1,  0)  and  (0,  0,  1).  The  fact  that  we  can  easily  find 
the  answer  independently  allows  us  to  check  our  solution. 

From  Fig.  12.10,  P  =  (1/2,  1/2,0)  and 


||(l/2,  1/2,0) -(0,0,  1)|| 
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Fig.  12.10 

Since  ||(1,  0,  0)  -  (0,  1,  0)||  =  >/2  the  (surface)  area  is  1/2  •  >/2  •  =  V3/2. 

We  now  calculate  the  area  using  ff  n.  Since  the  triangle  is  part  of  a  plane  the 

(S,n) 

normal  n  is  constant  on  S.  By  symmetry  it  is  easily  seen  that  the  triangle  lies  in 
the  plane  v  +  y  +  z  =  1  and  hence  the  unit  normals  are  ±(l/\/3,  l/\/3,  l/\/3). 
Let  us  take  (1/V3,  l/\/3,  1/V3)  as  our  unit  normal.  By  symmetry  we  only  have 

to  calculate  f f  0,  0).  Clearly  Py  z  projects  onto  the  anticlockwise  oriented 

(S,n) 

triangle  A\  C  ^  with  vertices  (0,  0),  (0,  1)  and  (1,  0)  and 


Hence 


(S,n) 


1 

— =  Area  (A\)  = 
V3 


1  1 
2' 


(S,  n) 


1  1 

Vf'2 


V3 

2 


and  this  agrees  with  our  earlier  result. 


Exercises 

12.1  Find  ff  F  where  F(x,y,  z)  =  (1,  2,  3)  and  S  is  the  triangle  with  vertices 

s 

(1,  0,  0),  (0,  2,  0),  (0,  0,  3)  oriented  so  that  the  origin  is  on  the  negative  side. 

12.2  By  using  projections  and  the  portion  of  the  sphere  that  lies  in  the  first  octant 
calculate  the  area  of  a  sphere  of  radius  r. 

12.3  Express  as  an  integral  over  a  region  in  M2  the  integral  ff  F  where  F (x,  y,  z)  = 

s 

(— 2/y3,  —6xy2,  2 z3/x3f  S  is  the  graph  of  f(u,  v)  =  uv3,  ( u ,  v)  e  [1,  2]  x 
[1,2]  and  the  parametrization  (p(u,v)  =  (u,v,  uv3)  is  consistent  with  the  ori¬ 
entation.  Evaluate  the  integral. 
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12.4  LetO  <  a  <  b  and  let  T  denote  the  circle  of  centre  ( b ,  0)  and  radius  a  in  the 
vz -plane.  Let  S  denote  the  surface  obtained  by  rotating  T  about  the  z-axis.  If  S 
is  oriented  with  outward  normal  and 


F(x,  y,  z)  =  (x - - 

V  ■ 


bx 


xl  +  y* 


z 


) 


show  that 


F  =  4n2a2b. 


12.5  Evaluate 


(a) 


(b) 


2  ,  2 
y  +  £ 


l 


(x2  +  y2  +  (z  +  a)2)l/2 


where  S  is  the  portion  of  the  sphere  x2  +  y2  +  z2  =  a 2  above  the  vy-plane 
oriented  by  the  outward  normal. 

12.6  Describe  the  surface  S  =  {(v,  y,  z)  :  z2  =  x2  +  y2,  1  <  z  <  3}.  If  n  is  the 
outward  pointing  normal  to  S  find 


(-xz,  ~ yz,x 2  +  y2) 
x2  +  y2 


(V,n) 


12.7  Let  S  denote  the  portion  of  the  level  set  z  =  tan-1(y/v)  which  lies  between 
the  planes  z  =  0  and  z  =  2tt,  inside  the  cone  x2  -\-  y2  =  z2  and  outside  the 
cylinder  x2  +  y2  =  n2 .  Let 


G(v,  y,  z)  = 


(-x,  -y,  -z) 

O2  +  y2  +  z2)3/2  ’ 


Show  that 


F(r,  0)  =  (r  cos  0,  r  sin  0,  0),  n  <  r  <  0,  tx  <  0  <  In 


12.8 


is  a  parametrization  of  S.  If  n  = 


— r- - —  evaluate  ff  G. 

Fr  x  Fe  II  (lJn) 


Evaluate  ff  F  where  F(x,y,z)  =  (y2  +  z2,  tan  l(x/z),  z  expO2  +  y2))  and 
s 


S  is  the  part  of  the  sphere  of  radius  a  centered  at  the  origin  in  the  first  octant 
oriented  outwards  by  (a)  using  a  parametrization  and  (b)  projecting  onto  the 
coordinate  planes. 


Chapter  13 

Stokes’  Theorem 


Summary  We  discuss  Stokes’  theorem  for  oriented  surfaces  in  R3. 

Stokes’  theorem,  the  fundamental  theorem  of  calculus  for  surfaces ,  generalises 
Green’s  theorem  to  oriented  surfaces  S  =  (S,  n)  with  edge  or  boundary  T  (the 
term  edge  avoids  confusion  with  our  other  use  of  the  word  boundary)  consisting  of 
a  finite  number  of  piecewise  smooth  directed  curves.  We  suppose  that  the  positive 
side  of  S  lies  on  the  left-hand  side  as  we  move  along  T  in  the  positive  direction.  In 
practice  this  consistency  between  the  orientations  of  the  surface  and  its  edge  may  be 
verified  by  sketching.  In  certain  cases  the  normal  n  admits  a  continuous  extension  to 
the  boundary  and  a  parametrization  P  of  the  boundary  has  the  correct  orientation  if 
at  one  point,  say  P  (to),  we  have  P' (to)  •  n(P(fo))  >0  where  n^P  (to))  is  the  value  of 
the  extension  of  n  at  P (to).  If  a  consistent  parametrization  of  the  surface  extends  to 
give  a  parametrization  of  the  boundary  then  the  boundary  is  also  correctly  directed 
(see  Example  13.5). 

Theorem  13.1  (Stokes’  Theorem)  Let  S  =  (S,  n)  denote  an  oriented  surface  in  M3 
with  boundary  T  consisting  of  a  finite  number  of  piecewise  smooth  directed  curves. 
We  suppose  that  the  positive  side  of  S  lies  on  the  left  of  the  positive  side  ofT.  If  F  is 
a  smooth  vector  field  on  S  Uf  then 


JJ  curl(F) 

s 


(13.1) 


i.e. 


(F,  T)ds  =  // (curl(F),n )dA 


where  T  is  the  unit  tangent  to  the  directed  curves  T. 

The  proof,  which  we  omit,  is  obtained  by  applying  Green’s  theorem  to  the  pro¬ 
jections  onto  the  coordinate  planes.  In  Chaps.  6  and  12  we  developed  techniques 
to  evaluate  the  left-  and  right-hand  sides  of  (13.1),  respectively.  Thus  the  only  new 
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Fig.  13.1 

factor  in  Stokes’  theorem  is  the  correlation  between  the  orientations  of  the  surface 
and  its  boundary. 

Example  13.2  We  use  Stokes’  theorem  to  evaluate  the  line  integral 


—y3dx  +  x3dy  —  z3dz 


where  C  is  the  intersection  of  the  cylinder  x2  +  y2  =  3  and  the  plane  v  +  y  +  z  =  1 
and  the  orientation  on  C  is  anticlockwise  when  viewed  from  a  point  sufficiently  high 
up  on  the  z-axis.  Let  S  denote  the  portion  of  the  plane  inside  the  cylinder  oriented  so 
that  the  normal  lies  above  the  surface  (Fig.  13.1). 

Let  F  =  (— y3,  v3,  — z3)  then 


curl(F)  = 


l 

d 

dx 


j  k 

a  a 

dy  dz 

3  3 

V  —  Z 


(0,  0,  3x2  +  3y2). 


By  Stokes’  theorem 


JJ  curl(F) 

s 


(0,0,  3v2  +  3y2), 


Since  only  the  final  coordinate  of  curl(F)  is  non-zero  our  analysis  in  the  previous 
chapter  implies  that  we  only  need  consider  the  projection  of  S  onto  R2x  y  As  C 
projects  onto  an  anticlockwise  oriented  curve  V i  (Fig.  13.1)  our  projection  is  onto 
the  positive  side  of  R2x  .  Hence 


x2+y2<  3 


(3x2  +  3  y2)dxdy. 
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We  parametrize  the  surface  x2  +  y2  <  3  in  M3  by 

0(r,  0)  =  (r  cos  6 ,  r  sin0,  0),  0  <  r  <  V3,  0  <  0  <  2ti. 

We  have  <pr  =  (cos  6,  sin  6,  0)  and  <po  =  (— r  sin  6,  r  cos  0,  0).  Hence 


(pr  X  <pe  = 


i  j  k 

cos  0  sin  6  0 

r  sin  6  r  cos  6  0 


=  (0,  0,  r), 


Since 


0r  X  (pp 
Ur  X  (pe  II 


=  (0,  0,  1) 


our  parametrization  is  consistent  with  the  positive  orientation  of  R2x  y  We  have 


r2n  27tt 

F  =  /  /  3r2  •  \\(pr  x  (pe\\drd0  =  6n  r3 dr  =  - 

c  Jo  Jo  Jo  2 


Moral  A  reasonable  sketch  is  not  just  optional  but  necessary.  The  form  of  F,  i.e. 
the  fact  that  the  first  two  coordinates  were  zero,  combined  with  information  on  how 
surface  integrals  can  be  projected  onto  the  coordinate  planes  greatly  simplified  the 
calculations  required. 


Example  13.3  We  evaluate  JJ  curl(F)  where 

S 


F(v,  y,  z)  =  (y2cosvz,  x3e-vz,  —  e  xyz) 

and  S  is  the  portion  of  the  sphere  x2  -\-  y2  -\-  (z  —  2) 2  =  8  which  lies  above  the 
vy-plane  oriented  outwards.  The  edge  or  boundary  of  S,  T,  is  where  the  sphere  cuts 
the  vy-plane,  i.e.  where  z  =  0.  We  have  v2  +  y2  +4  =  8,  i.e.  x2  +  y2  =4  (Fig.  13.2). 

Since  the  positive  side  of  the  sphere  is  the  outside  we  see  from  Fig.  13.2  that 
the  surface  S  is  on  the  left  as  we  move  along  T  in  an  anticlockwise  direction  in  the 
vy-plane.  This  gives  us  our  direction  along  T.  By  Stokes’  theorem 


JJ  curl(F), 

s 


But  T  with  anticlockwise  direction  is  also  the  boundary  or  edge  of  the  surface 

Si  =  {(x,  y,z)  :  X2  +y2  <  4,  z  =  0} 
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Fig.  13.2 


oriented  by  the  normal  (0,  0,  1).  Hence  a  further  application  of  Stokes’  theorem 
implies 


JJ  curl(F), 


Now 


l 


J 


k 


curl(F)  = 


d  d 

dx  dy 

y2cosxz  x3eyz 


d 

dz 

Q-xyz 


Since,  however,  Si  projects  onto  smooth  curves,  which  have  zero  surface  area  in 
R2y  ^  and  R2x  zy  it  suffices  to  consider  the  final  coordinate  of  curl  (F).  This  is 


d 

dx 


d 


(v3evz)  —  —  (y2cosvz)  =  3x2eyz  —  2ycosxz 


and,  since  z  =  0  on  Si,  we  need  only  evaluate 


2  y)dxdy. 


x2+y2<  4 


By  symmetry 


(—2  y)dxdy  —  0. 


x2+y2<  4 
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Fig.  13.3 


If  we  use  the  parametrization 

(r,  6) - >  (r  cos  0,  r  sin  0,  0),  0  <  r  <  2,  0  <  0  <  2tx 


then 


3 r2  cos2  OrdrdO  = 


‘2jt 


COS 


2  OdO  /  3 r2  dr  =  1271. 


Moral  A  closed  curve  may  be  the  edge  or  boundary  of  more  than  one  surface  and 
a  suitable  choice  (of  surface)  may  simplify  calculations.  Projections  and  symmetry 
are  helpful. 

Example  13.4  We  wish  to  use  Stokes’  theorem  to  find  a  suitable  orientation  of  the 
curve  of  intersection  of  x2  +  y2  +  z2  =  a2  and  v  +  y  +  z  =  0,  T,so  that 

J  ydx  +  zdy  +  xdz  =  VStto2. 

The  curve  T  is  the  intersection  of  a  sphere  and  a  plane  through  the  centre  of  the 
sphere  and  hence  is  a  “great  circle”  or  “equator”  on  a  sphere  (see  Fig.  13.3). 

The  curve  T  is  the  edge  or  boundary  of  the  two  hemispheres  on  either  side  of  it 
and  also  of  a  portion  of  the  plane  v  +  y  +  z  =  0.  Which  we  use  will  depend  on  the 
function  being  integrated.  Let  F(v,  y,  z)  =  (y,  z,  x).  We  have 

i  j  k 

9  9  9 

dx  dy  dz 

y  z  x 


curl(F)  = 


=  (-1, -1, -1). 
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The  plane  v  +  y  +  z  =  0  has  unit  normals  ±(1,  1,  1)/V3  =  n\(x,  y,  z)  while  the 
sphere  has  unit  normals  ±(jc,  y,  z)/a  =  n2(v,  y,  z).  By  Stokes’  theorem 


ydx  +  zdy  +  xdz  = 


(5,  n) 


(-1,-1,-!) 


where  S  and  n  have  to  be  chosen  and  T  directed.  If  we  take  S  as  part  of  the  plane 
with  normal  (1,1,  1)/V3  then 

s  s  s 

=  —  VSna2 


since  S  is  a  disc  of  radius  a. 

Since  we  obtained  a  negative  answer  we  have  been  using  the  incorrect  orientation 
on  S  and  so  take  (-1/vT  -l/vT  -l/vT)  as  the  normal  which  describes  the 
orientation.  Hence  T  is  oriented  as  in  Fig.  13.3,  i.e.  it  appears  clockwise  when  looked 
at  from,  say,  the  point  (1 ,  1 ,  1)  or  from  any  point  sufficiently  far  out  in  the  first  octant. 

Moral  The  curl  of  a  vector  field  is  a  form  of  derivative.  If  the  entries  are  linear,  as 
in  this  example,  the  curl  is  constant.  If  the  entries  are  of  degree  2  then  the  curl  has 
linear  entries. 

Example  13.5  In  this  example  we  verify  Stokes’  theorem  for  the  portion  S  of  the 
surface  z  =  tan-1(y/v)  which  lies  inside  the  cone  x2  +  y2  =  z2  and  between  the 
planes  z  =  0  and  z  =  2tt  by  using  the  vector  field 

F(x,  y,  z )  =  xzi  +  yzj  -  (x2  +  j2)k. 

We  first  examine  the  surface  z  =  tan-1(y/v).  This  can  be  parametrized  as  a  graph 
using  Cartesian  coordinates  (see  Example  10.3)  but  it  is  preferable  to  use  polar 
coordinates. 

Let  x  =  r  cos  0,  y  =  r  sin  0  then 

tan-'  m  =»„"'« an0)  =  f> 

\xl  \rcos01 

and  we  obtain  the  parametrization 

(r,  6) - >  (r  cos  6,  r  sin^,  6)  (13.2) 

where  r  >  0  and  0  <  0  <  2tt  . 

What  sort  of  surface  is  this?  Well,  if  we  fix  different  values  of  r  and  let  6  vary 
we  obtain  a  helix  (see  Example  5.2).  In  Fig.  13.4  we  have  sketched  the  surface  for 
1/2  <  r  <  1.  We  are  considering  the  portion  of  the  surface  in  Fig.  13.4,  extended 
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Fig.  13.4 


Fig.  13.5 


over  all  r,  which  lies  inside  the  cone  and  between  the  two  planes.  It  is  thus  a  screw  - 
shaped  surface  or  spiral  staircase  with  the  spirals  or  steps  getting  wider  as  we  rise. 
The  fact  that  the  surface  lies  between  the  planes  z  =  0  and  z  =  tx  means  that  we 
have  just  one  full  twist  of  the  screw  or  one  turn  of  the  staircase  (Fig.  13.5). 

We  can  use  the  parametrization  (13.2)  but  the  restriction  caused  by  lying  inside 
the  other  surfaces  means  that  we  must  restrict  the  range.  Translating  the  boundaries 
into  polar  coordinates,  we  get 

2,2  2  2  /I  i  2*2/i  2  2 

x  +  y  =  r  cos  6  +  r  sin  6  =  r  =  z 

and  since  0  <  z  <  2tt  this  implies  r  =  z,  0  <  r  <  2tx  .  Hence  our  parametrization 
of  the  surface  is 


/ :  (r,  0) 


>  (r  cos0,  r  sin0,0),  0<r<6,0<6<  In. 
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We  have  F(/(r,  0))  =  ( rO  cos  0,  rO  sin0,  — r2)  and 


curl(F)  = 


i  j  k 

9  9  9 

dx  dy  dz 

xz  yz  -(x2  +  y2) 


=  — 3yi  +  3vj  =  (—3 r  sin0,  3 r  cos  6,  0), 


We  define  our  orientation  on  S  by 


frXfe  = 


i  j  k 

cos  0  sin  0  0 

-rsinO  rcosO  1 


=  (sin0,  —  cos0,  r), 


Hence 


II 

s 


‘2n  rO 


curl(F)  = 


(curl(F),  fr  x  fe)drd0 


'0  JO 
m  (r) 

*27 r  ,  rO 


—3rdr\d0  = 

'0  VO  '  JO 

3  f2n  ,  3  03  2jt 

2  J0  2  3  o 

^  =  -4,3. 


‘2n 


3r 


e 


o 


dO 


The  parametrization  of  S  is  over  the  set  U  in  M2  given  in  Fig.  13.6  and  the  boundary 
or  edge  of  the  surface  can  be  found  by  examining 


f(r,  0)  =  (r  cos  0,  r  sin0,  0) 


(13.3) 
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Fig.  13.7 

on  the  boundary  of  U.  We  can  also  look  at  it  geometrically  by  examining  where  the 
boundary  curves  intersect  S.  We  first  look  at  the  curve  of  intersection  of  the  cone 
and  the  screw.  The  cone  is  parametrized  (Examples  10.4  and  10.5)  by 

(r,  0) - >  (r  cos  6,  r  sin  0,  0) 

and  comparing  this  with  (13.3)  we  get  a  curve  of  intersection  T i  when  r  =  0.  This 
curve  is  parametrized  by 

0 - >  (6  cos  0,  0  sin  0,  0),  0  <  0  <  2jt. 

The  curve  T i  joins  the  origin  to  P\  =  (2n,  0,2tt).  The  second  curve  T2  is  obtained 
by  putting  0  =  2n  and  from  (13.3)  this  is  parametrized  by 

r - >  (r  cos  2tx ,  r  sin  2tv,  2tx)  =  (r,  0,  2n) 

where  0  <  r  <  2n .  This  is  the  straight  line  joining  P\  to  (0,  0,  2tv).  From  Fig.  13.6 
it  runs  in  the  negative  direction  and  so  we  must  reverse  the  orientation. 

The  third  curve  is  obtained  by  letting  r  =  0  in  (13.3)  and  we  have  a  parame- 
trization 

0 - >  (0,0,0),  0  <  0  <  2u. 

T  3  joins  (0,  0,  2tx  )  to  the  origin  and  T  =  TiUr2Ur3isa  closed  piecewise  smooth 
directed  curve  (Fig.  13.7). 

We  have  oriented  the  surface  and  directed  its  boundary.  Are  these  consistent  in 
order  to  apply  Stokes’  theorem?  Yes,  because  the  parametrization  /  of  T  is  obtained 
from  a  continuous  extension  of  a  consistent  parametrization  of  the  surface.  You  have 
already  seen  two  other  ways  in  which  it  is  possible  to  check  this  consistency.  We 
now  have  to  evaluate 
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We  have 


[  F=  [  ( — (0  cosO,  0  sin0,  0))  •  (62  cosO,  02  sin0,  —  02)  dO 

rn  Jo  ydd  / 


*2  JT 


(i 9 2  cos2  0  —  02  sin  6  cos  0  +  02  sin2  6  +  02  sin  0  cos  6  —  02)  dO 


‘2n 


0  do  =  0 


F  = 


r2 


/*h  /  ^  N. 

/  ( — (r,  0,  2n) )  •  (27rr,  0,  —  r2)dr 
Jlrt  'dr  / 


*0 


271  r 


2tx  r  dr  = 

’2n  2 

*0 


0 


2n 


~  —7t(27t)2  ~  — 47T3 


r3 

Hence 


F  =  /  (A(0, 0,  0))  •  (0,  0, =  0. 


F  =  -4: r3  =  JJ  curl(F) 

s 


and  we  have  verified  Stokes’  theorem. 

Moral  It  is  possible  to  verify  Stokes’  theorem. 

Exercises 

13.1  Let  S  denote  the  portion  of  the  sphere  x2  +  y2  +  z2  =  4 a2  in  the  first  octant 
which  lies  inside  the  cylinder  x2  +  y2  =  2 ax  oriented  outwards.  Let  T  denote 
the  boundary  or  edge  of  S  directed  in  accordance  with  Stokes’  theorem.  Sketch 
S  and  T.  Using  Stokes’  theorem  evaluate 

(a)  J  zdx  —  xdz , 

(b)  J  xdy  —  ydx , 

(c)  J  ydz  -  zdy. 

13.2  Sketch  the  surfaces  az  =  xy  and  x2  +  y2  =  b 2  in  M3.  Show  that 


0 


b2  sin  20 

->  ( b  cos  0 ,  b  sin  0 , - ) ,  0  <  0  <  2n 


2  a 


is  a  parametrization  of  the  intersection  V  of  the  two  surfaces  oriented  clockwise 
when  viewed  from  a  high  point  on  the  z-axis.  Show  that  the  surface  parame¬ 
trized  by 
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r2  sin  20 

P  :  (r,  0) - >  (r  cos  0,  r  sin0, - ),  0  <  r  <  b,  0  <  0  <  2n 

2  a 


has  T  as  its  edge  or  boundary.  Using  Stokes’  theorem  find 


ydx  +  zdy  +  xdz . 


13.3  Let  T  denote  the  curve  of  intersection  of  v  +  y  =  2b  and  x2  +  y2  +  z2  = 
2b (x  +  y)  oriented  in  a  clockwise  sense  when  viewed  from  the  origin.  Sketch 
the  appropriate  diagram.  Using  Stokes’  theorem  evaluate 


ydx  +  zdy  +  xdz . 


r 


13.4  Let  0  <  a  <  b  and  let  S  denote  the  torus  obtained  by  rotating  the  circle 
(x  —  b)2  +  z2  =  a2  in  the  xz- plane  about  the  z-axis.  Sketch  S.  Let  V  denote 
the  directed  curve  parametrized  by 

t - >  (( b  +  a  cos  nt )  cos  t,  (b  +  a  cos  nt )  sin  t,  a  sin  nt ) . 


Show  that  T  is  a  closed  curve  in  S.  Describe  and  sketch  S.  Let  S  denote  the 
surface  parametrized  and  oriented  by 


P  :  (r,  t) 


(r(b  +  a  cos  nt)  cos  t,  r(b  +  a  cos  nt)  sinL  a  sin  nt) 


where  0  <  r  <  1,  0  <  t  <  2tx  .  Show  that  T  is  the  boundary  of  S.  Let 
F(x,y,z)  =  (— y,  x,  0).  By  computing  both  fr  F  and  fs  curl(F)  verify  Stokes’ 
theorem.  Show  that  the  area  of  the  projection  of  S  onto  R2x  y)  is  it  ( a 2  +  2b2) /2. 

13.5  Let  S  denote  the  unit  sphere  oriented  outwards.  For  0  <  b  <  c  <  1  let 
SbiC  denote  the  part  of  the  sphere  between  the  planes  z  =  b  and  z  =  c.  Let 
F(X)  =  X||X||-3  for  all  X  ^  0  in  M3.  By  using  the  result  in  Exercise  6.3  and 
Stokes  Theorem  find  ff  F. 

&b,c 

13.6  Let  T  denote  the  curve  of  intersection  of  the  cylinder  x2  +  y2  =  a 2  and  the 
plane  x/a  +  z/b  =  l,  a  >  0,  b  >  0.  Use  Stokes’  theorem  to  find  a  direction 
along  T  so  that  fr  (y  —  z,  z  —  x ,  x  —  y)  is  positive.  Find  the  value  of  this  positive 
number. 

13.7  Use  Stokes’  theorem  to  find  a  suitable  orientation  of  the  curve  of  intersection  F 
of  the  hemisphere  x2  +  y2  +  z2  —  2 ax,  z  >  0,  and  the  cylinder  x2  +  y2  =  2 bx, 
0  <  b  <  a,  so  that 

/  ( y 2  +  z2)dx  +  (x2  +  z2)dy  +  (x2  +  y2)dz  =  2nab2. 


Chapter  14 

Triple  Integrals 


Summary  We  define  triple  integrals  of  scalar -valued  functions  over  open  subsets  of 
M3,  discuss  coordinate  systems  in  M3 ,  justify  a  change  of  variable  formula  and  use 
Fubinis  theorem  to  evaluate  integrals. 

Let  /  be  a  real- valued  function  defined  on  an  open  subset  U  of  M3 .  By  using  partitions 
of  the  coordinate  axes  to  draw  planes  parallel  to  the  coordinate  planes  (Fig.  14.1)  we 
obtain  a  grid  which  partitions  M3  into  cubes.  Let  (v; ,  y j ,  Zk)  denote  a  typical  point 
in  the  cube  [xi ,  1]  x  [yj ,  y/+i]  x  [zk,  Zk+ iL  The  Riemann  sum  of  /  with  respect 

to  this  grid 


'YF2FF_lf{xi,yj,Zk){xi+\  -Xi)(yj+ 1  -yj){zk+ 1  ~  Zk) 

i  j  k 


is  formed  by  summing  over  all  cubes  that  lie  in  U.  If  the  Riemann  sums  converge, 
as  we  take  finer  and  finer  partitions  and  grids,  to  a  limit  then  /  is  said  to  be  Riemann 
integrable  and  the  limit 


fix,  y,  z)  dxdydz 


u 


is  called  the  (Riemann)  integral  of  /  over  U . 

If  U  is  a  bounded  open  subset  of  M3  with  smooth  boundary  and  /  is  the  restriction 
to  U  of  a  continuous  function  /  on  U  then  /  is  integrable  over  U.  This  result,  proved 
using  uniform  continuity  of  /  on  the  compact  subset  U  of  M3  ,  implies  the  existence 
of  an  abundance  of  integrable  functions. 

If  fix,  y,  z)  =  1  the  Riemann  sum  is  the  volume  of  all  cubes  inside  U  and  in  the 
limit  equals  the  volume  of  U,  Vol (U).  Thus 

Vol(L)  =  JJJ  dxdydz . 

u 
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14  Triple  Integrals 


Fig.  14.1 


Fig.  14.2 


In  evaluating  triple  integrals  we  use  an  extension  of  Fubini  ’s  theorem.  This  is  obtained 
from  the  Riemann  sum  by  first  summing  over  i ,  taking  a  limit,  then  summing  over  j 
and  taking  a  limit  and  finally  summing  over  k  and  taking  a  limit.  To  justify  this  process 
it  is  usual  to  assume  that  the  domain  of  integration  has  a  “box-like”  appearance,  i.e. 
it  is  bounded  above  and  below  by  surfaces  z  =  fi(x,y)  and  z  =  fi(x,y),  front  and 
rear  by  surfaces  x  =  gi  (y ,  z)  and  x  =  g2(y,z)  and  on  the  left  and  right  by  surfaces 
y  =  h\(x,  z)  and  y  =  h2(x,  z).  The  situations  we  discuss  are  of  this  type  but  by  no 
means  reflect  the  full  range  of  examples  to  which  Fubini’ s  theorem  applies.  Many 
open  sets  can  be  partitioned  into  a  finite  union  of  sets  and  Fubini ’s  theorem  applies 
to  each  of  these.  We  refer  to  our  remarks  on  Green’s  theorem  in  Chap.  9  for  further 
details. 

To  apply  Fubini ’s  theorem  we  must  examine  various  cross-sections  of  the  domain 
of  integration  U.  First  suppose  that  the  set  of  non-empty  cross-sections  of  U  parallel 
to  the  vy-plane,  i.e.  those  obtained  by  fixing  the  z-coordinate,  determine  an  interval 
(i a ,  b)  on  the  z-axis  (Fig.  14.2). 

Let  A(z)  denote  the  cross-section  defined  by  fixing  z  in  (a,  b),  i.e. 

A(z)  =  {(x,  y)  e  :  (x,  y,  z)  e  U). 
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Fig.  14.3 


This  means 


fix,  y,  z)  dxdydz  = 


fix,  y,  z)  dxdy 


dz 


u 


a  A(z) 


and  we  now  have  to  evaluate  the  inner  integral  of  two  variables.  In  some  cases  it 
is  possible  to  do  this  directly,  see  for  instance  Example  14.1,  but  usually  we  apply 
the  two- variable  Fubini’s  theorem  to  each  A(z).  For  this  we  assume  that  the  region 
Aiz)  C  y)  is  bounded  on  the  left  and  right  by  the  graphs  of  functions  of  y  over 
some  interval  (Fig.  14.3).  Both  the  functions  and  the  interval  will  depend  on  z  and 
we  denote  the  interval  by  (c(z),  diz))  and  the  functions  on  the  left  and  right  by 
y  — >  l(y,  z)  and  y  — >►  r(y,  z)  respectively. 

This  implies 


d(z)  r(y,z) 


fix ,  y,  z)  dxdy  = 


fix ,  y,  z)dx 


Mz) 


c(z )  l(y,z) 


dy 


and 


b  d(z)  r(y,z) 


fix,y,z)  dxdydz  = 


fix,  y,  z)dx 


u 


a  c(z )  Ky,z) 


dy 


dz. 


Thus  to  evaluate  triple  integrals  it  is  necessary  to  identify,  by  sketching,  cross-sections 
of  the  open  set  U.  Once  this  has  been  achieved  and  the  result  compared  with  the 
abstract  figures  above  it  is  a  matter  of  writing  down  the  iterated  integrals  and  evalu¬ 
ating  them  using  one- variable  integration  theory.  We  have,  as  in  the  two-dimensional 
case  a  choice  in  the  order  of  integration — in  fact  a  total  of  3!  =  6  choices,  some  may 
be  easy,  others  difficult  and  some  impossible.  There  are  no  definite  rules. 
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A  particularly  simple  situation  occurs  when  U  =  (a\,b\)  x  (<22,  £>2)  x  (<23,  £>3) 
and  f(x,y,z)  =  g(x)h(y)k(z)  where  g,  h  and  k  are  functions  of  a  single  variable. 
In  this  case 


b\  b2  b 3 

/<*•  (/*<H 


t/ 


«2 


«3 


For  example,  if  1/  =  (0 ,  a)  x  (0,  ft)  x  (0,  c)  then 


a 


b 


xy2z2  dxdydz  =  I  xdx  •  /  yzdy  •  /  = 


a2b3c 
24 


4 


£/ 


0  0 


0 


Example  14.1  Let  .S  denote  the  solid  ball  of  radius  r  with  centre  at  the  origin,  i.e. 
B  =  {(x,  y,  z)  :  x2  +  y2  +  z2  <  r2}.  We  calculate  the  volume  of  B  by  integrating 
the  function  /  =  1  over  B.  Figure  14.4a  shows  clearly  that  the  values  of  z  which 
give  non-zero  cross-sections  lie  in  the  interval  (— r,  r)  and  the  cross-section  for  fixed 
z  is  the  disc  x2  +  y2  <  r2  —  z2  (Fig.  14.4b). 

In  this  special  case  a  direct  computation  is  possible  since 


1  dxdy  =  Area  (A(z))  =  tt (rA 


Mz) 


and 


Vol(5)  = 


1  dxdy  \  dz 


! 


=  /  n(r2  —  z2)dz  =  7 r  (  rzz 


(' 


(14.1) 


(14.2) 


We  now  consider  the  more  typical  approach  to  evaluating  the  inner  integral  over  A(z) 
by  applying  Fubini’s  theorem  in  two  variables.  In  Fig.  14.4b  we  have  sketched  the 
cross-section  A(z)  in  the  vy-plane.  The  equation  x2  +  y2  =  r2  —  z2  (z  fixed)  has 
two  solutions 


2 


These  give  the  total  variation  of  x  and  the  boundary  functions,  /  and  r.  We  thus  have 
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(a)  (b) 


Fig.  14.4 


+Vr2-z2  +\/i 


r2_y2_z2 


Vol(fl)  = 


dx 


-Jr2-z2  —a/  r2—y2—z2 


dy 


dz 


■ +Vr2-z 2 


r 

2/' 

— r 


-Vr2-z2 

‘+7T/2 
—  7T/2 

.2  2 


2\/ r2  -y2  -  z 2  dy 


dz 


(r2  —  z2)  cos2  0 dO  1  dz 


IX  l  (r2  -  Z2)dz  =  7T  ^r2Z  -  y 


) 


47rr- 


where  we  let  y  =  (r2  —  z2)1//2  sin0,  dy  =  (r2  —  z2)1//2  cos^. 

The  geometry  of  the  previous  example  was  reasonably  straightforward.  Many 
examples  appear  initially  to  involve  a  rather  complicated  geometric  shape.  However, 
we  are  usually  dealing  with  a  limited  number  of  objects,  mainly  conic  sections, 
mixed  together  and  once  familiarity  with  these  has  been  established  and  sufficiently 
many  cross-sections  sketched — with  the  help  of  the  defining  inequalities — the  correct 
approach  often  presents  itself. 

Example  14.2  We  wish  to  find  the  volume  of  the  region  V  lying  below  the  plane 
z  =  3  —  2y  and  above  the  paraboloid  z  =  x2  +  y2,  i.e.  the  set  {(v,  y,  z)  :  x2  y2  < 
z  <  3  —  2y}.  We  begin  by  considering  a  full  sketch  (Fig.  14.5). 

The  coordinates  of  P  and  Q  are  found  by  solving  between  the  equations  z  = 
x2  +  y2  and  z  =  3  —  2 y.  These  imply  x2  +  y2  =  3  —  2 y,  i.e.  x2  +  y2  +  2y  +  1  =4. 
Hence  v2  +  (y  +  l)2  =  22.  The  extreme  values  of  y  are  obtained  by  letting  v  =  0. 
This  gives  (y  +  l)2  =  22,  i.e.  y  +  1  =  ±2.  Hence  y  =  —3  or  +1.  The  coordinates 
of  P  and  Q  are  (0,  —3,  9)  and  (0,  1,  1)  respectively.  Hence  —  3  <  y  <  +  1.  From 
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Fig.  14.5 


Fig.  14.6 


Fig.  14.5  we  see  0  <  z  <  9  and  from  the  equation  x2  +  (y  +  l)2  =  2  we  deduce  that 
— 2  <  x  <  2.  Having  found  the  extremal  values  for  non-empty  sections  we  sketch 
the  corresponding  cross-sections. 

First  fix  y ,  —  3  <  y  <  1 .  In  the  zv -plane,  z  =  y2-\-x2  is  a  parabola  and  z  =  3  — 2y 
is  a  straight  line.  From  Fig.  14.6  it  follows  that  for  fixed  y, 

—(3  -2y-  y2)x'2  <  x  <  (3  -  2 y-  y2)1'2 
and  for  fixed  x  and  y 

x2  y2  <  z  <  3  -  2y. 


We  have 


1  (3—2  y-y2)l/2  3-2  y 


-3  — (3— 2y— j2)1/2  x2+y2 

1  (3—2  y-y2)1/1 


-3  —  (3— 2y— j2)!/2 


-  - 

3-2  y 

z 

dx 

—  — 

x2+y2 

dy 


Vol(V)  = 
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1  (3-2y-y2)1/2 


(3  —  2  y  —  y2  —  x 2)  dx 


—3  —  (3 — 2jy — y2)  I/2 

1 


dy 


(3-2 y-  y2)x  - 


3  -i(3-2 y-y2)l/1 


^  -  —  (3— 2y— j2)1/2 

1 


dy 


=  j\o-2y-  S-?r-dy  =  4-j(t-(y+  1,2)  3,2  rf. 


Let  y  +  1  =  2  sin  0,  then  dy  =  2  cos  OdO,  4  —  (y  +  l)2  =  4  —  4  sin2  0  =  4  cos2 
so 


n/2 


Vol(V)  =  -  /  (4  cos2  0)3/2  •  2  cos 


—n/2 

n/2 


64 


cos4 


-n/2 

n/2 


64  f  1  /  1  \ 

—  /  -  (l  +  2  cos  20  +  -(1  +  cos  4^)  j 

—n/2 


16  37T 


3  2 


=  871. 


We  choose  to  take  y  as  the  final  variable  in  our  order  of  integration  since  it  is  clear 
from  Fig.  14.5  that  all  cross-sections  parallel  to  the  vz-plane  are  of  the  same  type 
whereas  cross-sections  parallel  to  the  vy-plane,  i.e.  fixing  z,  are  different  for  z  <  1 
and  z  >  1 .  A  different  order  of  integration  can  be  used  as  a  second  opinion. 

From  the  two  previous  examples  we  see  that  the  evaluation  of  triple  integrals 
proceeded  in  three  stages.  First  we  chose  an  order  of  integration  and  next  examined  the 
geometry  of  the  domain  of  integration  in  order  to  determine  the  limits  of  integration 
in  the  inner  integrals.  Finally  we  evaluated  a  sequence  of  one  variable  integrals.  The 
alternatives  at  each  stage  in  the  process  point  towards  a  useful  technique  commonly 
called  change  of  variable.  In  place  of  a  detailed  motivation  we  briefly  mention  three 
pertinent  ideas. 

(a)  The  domain  of  integration  U  was  partitioned  into  cubes,  Fig.  14.7a,  with  element 
of  volume  A  V  easily  calculated  using  Ax  x  x  A z.  We  could  use  instead  a 
grid  based  on  spheres  centred  at  the  origin  and  planes  through  the  origin  to  obtain 
a  different  element  of  volume,  Fig.  14.7b,  or  a  grid  based  on  cylinders  parallel 
to  the  z-axis  and  planes  perpendicular  and  parallel  to  the  z-axis  (Fig.  14.7c). 
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Fig.  14.7 


These  alternatives  lead  to  more  complicated  formulae  for  AV  but  hopefully 
the  new  limits  of  integration  and  the  resulting  one-variable  integrals  are  less 
complicated. 

(b)  We  used  Cartesian  coordinates  (x,  y,  z)  to  denote  a  typical  point  in  the  domain 
of  integration  U  but  we  may  consider  other  methods  of  identifying  points  in  U. 
For  instance,  if  U  is  the  solid  sphere  of  radius  1  then  each  point  in  U  lies  in  a 
sphere  of  radius  r,  0  <  r  <  1,  and  using  the  parametrization  of  the  sphere  of 
radius  r  given  in  Example  10.5  we  can  identify  points  of  U  by  means  of  r  (the 
distance  to  the  origin),  0  the  angle  of  latitude  and  x/r  (the  angle  of  longitude). 
In  terms  of  the  coordinates  (r,  6,  x/r)  the  domain  U  becomes  the  parallelepiped 
(0,1)  x  (— 7t/2,  7r/2)  x  (0,  2tt)  and,  as  previously  noted,  integration  in  this  case 
is  much  more  pleasant.  The  new  set  of  coordinates  gives  a  correspondence  F 
between  a  domain  in  Rj?r  e  ^  and  the  original  domain  U  in  RjC,z)  (Fi§- 14-8>- 
By  Example  10.5,  F(r,6,  xfr)  =  (r  cos  0  cos  x\r,  r  cos  0  sin  x/r,  r  sin  0).  The  idea 
now  is  to  transfer  the  cubical  grid  on  U,  using  F,  to  a  grid,  which  is  usually 
not  cubical,  on  F(U)  and  hence  to  evaluate  the  integral.  In  carrying  out  this 
operation  it  will  be  necessary  to  calculate 

Vol  (F(A  V))  =  Vol  (F(Ar  x  A  Ox  Ax//)) 


Fig.  14.8 
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and  this  is  the  problem  that  also  arises  in  (a).  The  mapping  F  has  many  of  the 
features  that  we  have  previously  associated  with  a  parametrization  and,  by  now, 
the  following  definition  should  appear  natural. 

Definition  14.3  A  parametrization  of  an  open  set  V  in  M77  is  a  bijective  differentiable 
mapping  F  from  an  open  subset  U  of  M77  onto  V  such  that  F'(X )  is  an  invertible 
linear  operator  for  all  X  in  U. 

This  definition  contains  the  essential  properties  that  we  used  to  parametrize  curves 
and  surfaces  and  may  also  be  regarded  as  a  method  of  providing  V  with  a  new 
coordinate  system.  The  requirement  that  F'(X )  be  invertible  (or  equivalently  that 
det(F'(X))  7^  0)  is  the  three-dimensional  analogue  of  the  condition  P'(t)  ^  0  for 
curves  and  <px  x  0y  ^  0  for  surfaces. 

(c)  A  third,  perhaps  more  obscure,  approach  is  motivated  by  the  substitutions  that 
may  arise  in  the  one- variable  integrals  in  the  final  stage.  In  Example  14.2  we 
needed  one  such  change.  Working  backwards  it  may  be  possible  to  choose  ini¬ 
tially  a  coordinate  system  which  does  not  require  a  change  of  variable  in  the 
iterated  integrals. 

We  return  now  to  (b)  above  to  work  out  the  formula  for  the  change  of  variable. 
Let  U  denote  an  open  subset  of  M3r  s,  t)  and  let  g:  U  — >  g(U)  =  V  denote  a 

parametrization  of  the  open  subset  V  of  M3m  v  wy  Note  that  g_1  is  a  parametrization 

of  U.  To  avoid  confusion  we  think  of  U  and  V  as  lying  in  different  copies  of  M3  each 
with  their  own  set  of  coordinates,  (r,  s,  t)  and  (u,  v,  w)  respectively.  This  explains 
the  terminology  “change  of  variables”. 

Let  /  denote  an  integrable  function  on  g(U)  (Lig.  14.9).  Take  a  cubical  grid  on 
U,  transfer  it  by  g  to  a  grid  on  g(U)  and  then  form  a  Riemann  sum  of  /.  A  typical 
term  in  this  Riemann  sum  is 

/  (, g(n,Sj,tk ))  Vol  (g(Ar,-  x  A sj  x  Atk)) . 


Fig.  14.9 
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(a) 


(b) 


gtAt 


grAr 


Fig.  14.10 


Now  g  ( Art  x  Asj  x  Atk)  is  approximately  a  parallelepiped  with  sides  gr  A r,  gs  As 
and  At.  We  have  already  discussed  the  volumes  of  parallelepipeds  while  intro¬ 
ducing  Stokes’  theorem  in  Chap.  12.  The  area  of  the  base  of  the  parallelepiped  is 
|| gr  x  gs  Art  •  Asj  ||  (Fig.  14.10a)  where  the  partial  derivatives  of  g  are  evaluated  at 
(ji ,  s  j ,  tk ).  The  height  of  the  parallelepiped  is  the  length  of  the  projection  of  gt  Atk 
onto  a  direction  perpendicular  to  the  base.  Since  gr  x  gs  is  perpendicular  to  the  base 
the  required  height  is  (gr  X  fo  •  gt  /  lift  X  gs||)  Atk  =  lift  II  cos  6  Atk  (Fig-  14.10b). 

Hence 


Vol  (g(Ari  x  As  j  x  A  tk))  ||fo  x  fo  •  ft||  ArtAsj  Atk. 


By  Exercise  6.5(a) 


llftXfo-ft||=  det(g') 


and 


Vol  (g(Ar;  x  A Sj  x  Atk))  ^  |det(g')|  ArjAsjAtk . 


Hence  the  Riemann  sum  of  /  over  g(U)  is  approximately 


where  we  sum  over  the  cubes  in  the  partition  of  U.  In  the  limit  we  get  the  change  of 
variables  formula 


/// 


f(u,  v,  w)dudvdw  = 


/// 


f(g(r,s,t))  det  [g'(r,  s,  t))  drdsdt. 


g(U) 


u 
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d(u,v,  w) 

The  notations  Jig)  and - are  also  used  in  place  of  det(g  )  and  this  deter- 

d(r,s,t) 

minant  is  called  the  Jacobian  of  g. 

If  V  is  an  open  subset  of  M2  then  we  may  identify  V  with  the  open  subset  V  = 
V  x  (0,  1)  in  M3.  A  function  / :  V  — >  R  is  integrable  over  V  if  and  only  if  /,  defined 
by  fix,  y,  z)  =  fix,  y),  is  integrable  over  V  and,  moreover, 


f{x,  y,  z)dxdydz  =  fix ,  y)dxdy 


v 


v 


If  g:  U  C  M2,  v)  — >  V  C  M2w  v)  is  a  mapping  between  the  open  sets  U  and  V  then 
it  is  easily  seen  that  g  is  a  parametrization  of  V  if  and  only  if  g  :  U  C  M3r  v  — > 

i3 

\u,v.w)' 


V  C  M3 . a,  defined  by  g(r,  s,  t)  =  (g(r,  s),t),  is  a  parametrization  of  V.  Since 


8  = 


g 


0 


we  have  det(gr)  =  det(gO  and 


V  V  u 

f  og|det(g')|. 


u 


This  justifies  the  change  of  variables  formula  for  double  integrals  (Chap.  9)  and 
yields  the  following  familiar  formula 


/ (w,  v)dudv  =  I  f(g(r,s)) 

v=g(U)  U 


Uy  n  $ 

vr  vs 


drds 


where  g(r,  s)  =  (u(r,  s),  v(r,  s)).  In  particular,  for  polar  coordinates  in  the  plane, 
g :  (r,  6)  — >  (x,  y)  =  (r  cos  0,  r  sin  0),  we  have 


det(s')l  = 


cos  0  sin  6 
-r  sin  0  r  cos  0 


=  r 


and 


f(x, y)dxdy  = 


x2+y2<  1 


0<r<l 

O<0<27r 


f{r  cos  0,  r  sin  0)rdrd6, 
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Fig.  14.11 


If  S  is  a  simple  oriented  surface  in  M3  and  (gi,  U\)  and  ( g2 ,  U2)  are  parametriza- 
tions  of  S  consistent  with  the  orientation  then  the  bijectivity  and  smoothness  of  gi 
and  g2  imply  that  the  mapping  gf{  o  gi  is  a  parametrization  of  U2  (Fig.  14.11). 

The  change  of  variables  formula  for  double  integrals  may  now  be  used  to  show 
that  integrals  over  S  are  independent  of  the  parametrization  and  justifies  the  notation 
used  in  earlier  chapters  (we  remark  that  gi  and  g2  map  onto  the  same  side  of  S  if  and 

only  if  det  ^(g^1  o  gi)^  is  always  strictly  positive). 

We  have  noted  above  how  a  solid  sphere  is  a  union  of  surfaces,  i.e. 

{(*,  y,  z)  :  X2  +  y2  +  Z2  <  r2}  =  [J  {(*,  y,  z)  :  x2  +  y2  +  z2  =  s2} 

0<s<r 


and  using  parametrization  of  surfaces  we  were  able  to  fill  in  to  obtain  a  parame¬ 
trization  of  the  solid  sphere.  In  fact,  we  do  not  quite  get  a  parametrization  of  the  full 
solid,  since  to  obtain  a  bijective  mapping,  we  are  forced  to  miss  a  small  portion  of  the 
sphere.  This  is  the  same  problem  that  we  encountered  and  discussed  fully  in  parame¬ 
trizing  the  classical  surfaces  (Chap.  10).  We  do  not  enter  into  a  full  discussion  here 
but  remark  that  after  a  similar  discussion  we  would  arrive  at  an  analogous  conclusion 
for  solids.  The  portion  of  the  solid  omitted  has  volume  zero  and  so  for  integration 
purposes  we  may  treat  the  mappings  in  the  following  example  as  parametrizations 
of  the  full  solid. 

Example  14.4  The  procedure  outlined  above  for  the  sphere  can  be  applied  to  a 
number  of  solids  and  using  Table  11.1  we  obtain  the  following  parametrizations. 

(a)  Solid  sphere  of  radius  a  (spherical  polar  coordinates) 

(0,  a)  x  (0, 7 r)  x  (0,  2n)  e  M3 - >  j(v,  y,  z)  :  x1  +  y2  +  z2  <  a2 J 

(r,  0 ,  x//) - >  (r  sin  0  cos  x/r,  r  sin  6  sin  if,  r  cos  0) 
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(b)  Solid  ellipsoid  (elliptical  polar  coordinates) 


(0,  1)  x  (0,  jt )  x  (0,  2tt)  e  M: 


x2  y2  z 2 

(x,  y,  z)  :  —  +  +  —  <  1 


a * 


b2 


cA 


(r,  0,  xfr) 


■>  ( ra  sin  6  cos  xj/,  rb  sin  6  sin  xj/,  rc  cos  0) 


(c)  Solid  of  revolution  generated  by  revolving  the  area  beneath  the  plane  curve 
Pit)  =  (x(t),  y(t )),  t  e  [a,  b]  and  y(t)  >  0,  about  the  v-axis  (Example  10.4), 

(0,  1)  x  ( a ,  b)  x  (0,  2tt)  e  M3 - >  Solid  of  Revolution 

(r,  t ,  0) - >  ( x(t ),  ry(t)  cos  0,  ry(t)  sin  0) 

(d)  The  solid  cylinder  of  radius  r  and  height  h  parallel  to  the  z-axis  is  defined  by 
{(v,  y,  z)  :  v2  +  y2  <  r2,  0  <  z  <  h }  and  parametrized  by 


(s,  0 ,  z) 


■>  (s  cos  6,  s  sin^,  z) 


with  domain  (0,  r)  x  (0,  2tt)  x  (0,  h)  (cylindrical  coordinates) 

(e)  The  inverted  solid  cone  {(v,  y,  z)  :  x2  +  y2  <  z2,  0  <  z  <  1}  is  parametrized 
by 

(r,  0,  z) - >  (r  cos  0,  r  sin^,  z) 


where  0<r  <  z,  0  <  0  <  2tt,0  <  z  <  1. 

(f)  The  inverted  solid  paraboloid  defined  by  {(v,  y,  z)  :  x2  +  y2  <  z,  0  <  z  <  1} 
is  parametrized  by 


(r,  0,  z) - >  (r  cos  6,  r  sin0,  z) 

where  0  <  r  <  <Jz,  0  <  0  <  2tv,0  <  z  <  1 . 

Example  14.5  In  this  example  we  calculate  the  volume  of  a  solid  torus.  We  discussed 
the  boundary  surface  of  a  torus  in  Example  11.1  and  obtained  the  parametrization 

f(Q,  V0  =  ((/?  +  r  cos  0)  cos  \j/,  (b  +  r  cos  0)  sin  if,  r  sin  0) 

where  0<0<27t,0<\l/<27t  (Fig.  1 1.4).  We  generate  the  solid  torus  by  rotating 
the  disc  and  obtain  the  parametrization 

F :  (0,  2tt)  x  (0,  2tv)  x  (0,  r) - >  Solid  Torus 

(0,  if,  s) - >  ((b  +  51  cos  6)  cos  if/,  ib  T  s  cos  6)  sin  x/s,  s  sin  0) . 

We  will  use  the  change  of  variable  formula  to  calculate  the  volume  but  first  note  how 
the  following  geometric  observation  immediately  gives  the  answer.  The  solid  torus 
is  obtained  by  rotating  a  disc  of  radius  r  with  centre  on  the  y-axis  at  a  distance  b 
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from  the  origin  about  the  z-axis.  Thus  the  disc  is  rotated  through  a  distance  2nb  and 
generates  a  solid  whose  volume  is 

nr2  •  2i rb  =  2tx 2r2b. 

By  the  change  of  variable  formula 


Volume  (Torus)  = 


det  ( F'(0 ,  \ //,  s))  |  dOdi/zds. 


[0,27t]  x  [0,27r]  x  [0,r] 


From  Table  1 1.1  we  obtain,  on  replacing  r  by  s,  the  first  two  columns  of  F!\Fq  =  fo 
and  F^  =  f^.  Hence 


(—  ssin^cost/f  —  (b  +  s  cos0)  sin  xj/ 

—ssinOsinx//  (b  +  s  cos0)  cos  x/r 
scosO  0 

From  the  matrix  representation  it  is  easily  seen  that  F$,  F ^  and  Fs,  i.e.  the  columns 
of  F ' ,  are  mutually  perpendicular  vectors  and  hence  generate  a  parallelepiped  shaped 
like  a  rectangular  box.  In  this  case  the  volume  is  the  product  of  the  lengths  of  the  sides. 
A  further  application  of  Table  11.1  gives  us  ||  Fq  ||  =  ||  fe  ||  =  y/~E  =  s  and  ||  Ff  ||  = 
\\fy\\  =  VG  =  b  +  scosO.  Since  ||F5||  =  (cos2  0  cos2  \fr  +  cos2  0  sin2  xfr  + 
sin2  0)1//2  =  1  we  have 


cos  0  cos  x/r 
cos  6  sin  x/r 
sin# 


and 


det  ( F'(0,x/r,s))\  =  \\Fe\\  •  \\F^\\  •  \\FS\\  =  s(b  +  s  cos 6), 


Volume  (Torus)  = 


s(b  +  s  cos  6)d0d\l/ds 


[0,2 tt]  [0,2 tt]  0 

2tt  r  2n 


=  /  dx/s 


(sb  +  s2  cos  0)d0 


ds 


0  0  0 
r 


=  2n  J  ( sbO  +  s2  sin0) 
0 


2jt 


0 


ds 


2 

Sz  r 


=  2tt  sb2nds  =  4n2b —  =  2jt2br2, 

2  o 


o 
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Fig.  14.12 


Fig.  14.13 


Example  14.6  To  find  the  volume  of  the  solid  contained  within  the  sphere  v2  +  y2  + 
z2  =  4 a2  and  the  cylinder  v2  +  (y  —  a)2  =  a2.  In  Example  12.3  we  considered  a 
geometric  situation  similar  to  the  present  one.  By  not  moving  the  origin  we  adopt 
here  a  slightly  different  approach.  From  Fig.  14.12  we  see  that  the  solid  lies  above 
and  below  the  plane  disc  x2  -\-  (y  —  a)2  <  a2  and  this  immediately  suggests  polar 
coordinates. 

The  volume  V  is  equal  to 


1 dxdydz . 

x2+y2+z2  <4a2 
x2+(y—a)2  <a2 

We  parametrize  the  solid  using  polar  coordinates  in  the  vy-plane  and  the  usual 
Cartesian  z  coordinate,  i.e.  we  use  the  cylindrical  coordinates 

F :  (r,  6 ,  z) - >  (r  cos0,  r  sin0,  z). 


Since 


(cos  9  —r  sin 9  0 

sin  9  r  cos  9  0 

0  0  1 


F'(r,  0 ,  z)  = 
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Fig.  14.14 


we  have  det  (F'(r,  6,  z))  =  r.  To  find  the  limits  of  integration  consider  Fig.  14.13  in 
the  vy -plane. 

We  see  that  0  <  0  <  it ,  0  <  r  <2*2  sin  0  and,  from  Fig.  14.12,  z 2  <  4a2  —  x2  — 
y2  =  4*22  —  r2,  i.e.  —\/\a2  —  r 2  <  z  <  V4 a2  —  r2.  Hence 


7 t  2a  sin 0  V4<22— r2 


Volume  = 


0  0 


rJz 


Jr 


\d0 


■  r2 


2<2  sin  0 


a/  4r/ 2  —  r2Jr 


J0, 


Let  s  =  4*2 2  —  r2.  Then  Js  =  —2 r  Jr  and 


71 

Volume  =  2  J 

o 


7T 


0 


4«2  cos2  0 


/ 

4<22 

2^3/2 


3 


(_Li/2)js 


4<22  cos2  0 
4  a1 


') 


dO 


dO 


cos3  0\)d0  = 


Example  14.7  In  this  example  we  calculate  the  volume  of  the  region  U  bounded 
above  by  the  paraboloid  z  =  9  —  x2  —  y2,  below  by  the  vy-plane  and  which  lies 
outside  the  cylinder  x2  +  y2  =  1  (Fig.  14.14). 

The  presence  of  v2 +y 2  in  the  defining  inequalities  suggests  cylindrical  coordinates 
(r,  6,  z)  (the  presence  of  x2  +  y2  +  z 2  would  suggest  geographical  or  spherical  polar 
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coordinates).  From  Fig.  14.14  we  see  that  U  projects  onto  {(x,  y)  :  1  <  x2-\-y2  <  9} 
in  the  xy-  plane.  Hence  1  <  r  <  3  and  z  varies  over  0<z<9  —  x2  —  y2  =  9  —  r2. 
From  the  previous  example  we  know  that  the  Jacobian  is  equal  to  r.  Hence  the 
required  volume  is 


2jt 


0 


3 


1 


r3)dr  \d0 


=  2n  • 


Exercises 


14.1  Show  that  the  volume  of  the  solid  inside  the  cylinder  x2  +  y2  —  lay  =  0  and 
between  the  plane  z  =  0  and  the  paraboloid  4 az  =  x2  +  y2  equals  3na3 /%. 

14.2  Find  the  volume  of  the  solid  inside  the  cylinder  x2  +  y2  =  lay  which  lies 
between  the  plane  z  =  0  and  the  cone  x2  +  y2  =  z2. 

14.3  Show  that  the  volume  of  the  solid  defined  by  the  inequalities  x2  +  y2<  1  and 
tan _1(y/x)  <  z  <2tt  equals  n2. 

14.4  Let  U  denote  the  region  above  the  plane  z  =  0  between  the  cone  z2  =  x2  +  y2 
and  the  paraboloid  z  =  2  —  x2  —  y2.  Show  that  this  region  projects  onto  the 
unit  disc  in  the  xy -plane.  Using  cylindrical  coordinates  or  otherwise  show  that 
the  volume  of  U  equals  5tt/6. 

14.5  Evaluate  the  following  integrals: 


-2 


3—2y  (z-y2)l/2 


dx 


y2  ~(z— y2)1/2 

—  \+{A—x2)l/2  3-2  y 


dz 


dy 


dz 


■1  — (4— x2)!/2  x2+y2 

rz  3— z 

\[Z  2 


dy 


dx 


dy 


~y/z  —(Z-X2)1/2 


dx 


dz . 


14.6  Use  the  change  of  variables  formula  to  calculate  the  volumes  of  the  solids 
parametrized  in  Example  14.4. 

14.7  Find  the  volume  of  the  wedge  of  the  cylinder  {(x,y,  z)  :  x2  +  y2  <  1}  which 
lies  above  the  xy -plane  and  between  the  planes  z  =  —  y  and  z  =  0. 
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14.8  Find  the  volume  of  the  open  set  which  lies  above  the  square  0  <  v  <  1, 
0  <  y  <  1  in  the  xy- plane  and  below  the  surface  z  =  (1  +  x  +  y)1/2.  Hence 
write  down  the  volume  of  the  set  enclosed  by  the  surfaces  z  =  (1  +  lvl  +  lyl)1/2, 
z  =  —  (1  +  \x  \  +  lyl)1/2  and  by  the  planes  x  =  l,x  =  —  1,  y  =  1  and  v  =  —  1 . 

14.9  Find  the  volume  (of  the  ice-cream  cone)  that  lies  above  the  vy -plane,  inside 
the  cone  3(x2  +  y2)  =  z 2  and  the  sphere  x2  +  y2  +  z2  =  4 a2. 

14.10  Evaluate 


III 


zdxdydz 


v 


where  V  =  {(v,  y,  z)  :  0  <  z  <  x^/y,  0  <  y  <  1,  l<v<2}. 

14. 1 1  Use  the  change  of  variables 

F  :  (u,  v,  w) - >  (u{  1  —  v),  uv(  1  —  w),  uvw ) 


to  calculate 

J j j  xdxdydz  and  jjj 
V  V 

where  V  is  the  tetrahedron  cut  from  the  first  octant  by  the  plane  x  +  y  +  z=l. 


Chapter  15 

The  Divergence  Theorem 


Summary  We  state,  discuss  and  give  examples  of  the  divergence  theorem  of  Gauss. 

The  divergence  theorem  of  Gauss  is  an  extension  to  M3  of  the  fundamental  theorem 
of  calculus  and  of  Green’s  theorem  and  is  a  close  relative,  but  not  a  direct  descendent, 
of  Stokes’  theorem.  This  theorem  allows  us  to  evaluate  the  integral  of  a  scalar- valued 
function  over  an  open  subset  of  M3  by  calculating  the  surface  integral  of  a  certain 
vector  field  over  its  boundary. 

In  Chap.  6  we  defined  the  divergence  of  the  vector  field  F  =  (/i ,  /2,  ff)  as 


div(F)  =  V  •  F  = 


( 


9  9  9 


dx  ’  9y  ’  dz 


) 


■  (A,  h,  h) 


9/l  9/2  9/3 

dx  dy  dz 


and  we  have  previously  written  symbolically 


curl(F)  =  V  x  F. 

Carrying  this  symbolism  a  step  further  we  now  write 


div(curl  F)  =  V  •  V  x  F  = 


9 


dx 

9 


dx 

h 


9 


dy 

9 


dy 

h 


9 

dz 

9 

dz 

h 


This  suggests,  since  the  determinant  of  a  matrix  with  two  identical  rows  is  zero,  that 
div(curl  F)  =  0  for  any  smooth  vector  field  F.  This  is  indeed  true.  Symbolism,  how¬ 
ever,  does  not  prove  anything  and  it  is  necessary  to  verify  this  formally.  At  the  same 
time  it  is  a  good  example  of  the  role  of  symbolism  (and  notation)  in  mathematics — 
it  can  be  suggestive — and  sometimes  leads  to  true  results  that  might  otherwise  be 
overlooked. 
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IfF  is  the  velocity  of  a  gas  then  div(F)  represents  the  rate  of  expansion  (or  com¬ 
pression)  per  unit  volume.  The  divergence  theorem  states  that  the  total  expansion 
(or  contraction)  over  a  region  U  equals  the  total  inflow  (or  outflow)  across  the  bound¬ 
ary.  Very  little  physical  intuition  is  required  in  order  to  accept  this  as  reasonable.  This 
physical  interpretation  is  responsible  for  the  terminology  “divergence”. 

We  now  formally  state  the  divergence  theorem. 

Theorem  15.1  (Gauss’  Divergence  Theorem)  Suppose  S  is  an  oriented  surface  in 
M3,  with  outward  pointing  normal,  enclosing  an  open  set  U  and  F  is  a  smooth  vector 
field  on  U  (=  U  U  S)  then 

//F=///div,r)' 

s  u 

An  initial  proof  uses  Green’s  theorem  and  Fubini’s  theorem  on  cross-sections  of 
U  and,  as  noted  in  earlier  chapters,  this  places  certain  geometrical  restrictions  on 
U  and  S.  From  the  previous  chapter  we  see  that  it  suffices  to  have  U  bounded,  top 
and  bottom,  above  and  below,  front  and  back  by  graphs  of  functions  which  form  the 
boundary  S.  This  covers  many  examples  involving  classical  Euclidean  shapes. 

Further  examples,  for  instance  the  annulus  0  <  r\  <  x2  +  y2  +  z2  <  r 2  <  00, 
are  obtained  by  dividing  the  open  set  into  a  finite  number  of  sets  on  each  of  which 
the  initial  proof  applies  (we  refer  to  our  discussion  on  Green’s  theorem  in  Chap.  9 
for  further  details).  In  general,  the  boundary  will  be  composed  of  a  finite  number  of 
distinct  simple  surfaces. 

The  collection  of  sets  to  which  the  divergence  theorem  applies  is  quite  large  but 
difficult  to  formulate  precisely  without  recourse  to  further  concepts  from  differential 
geometry.  For  this  reason  we  have  carefully  avoided  proving  the  theorem  and  giving 
detailed  hypotheses  on  U  and  S  in  the  statement  of  the  theorem.  The  theorem,  as 
stated,  is  sufficient  for  the  examples  we  consider. 

Example  15.2  In  this  example  we  use  the  divergence  theorem  to  evaluate 


/// 

v 


div(F  )dxdydz 


where  V  is  the  solid  cylinder  {(x,  y,  z)  :  x2  +  y2  <  1,  0  <  z  <  1}  and 

F(x,  y,  z)  =  (1  -  (*2  +  J2)3,  1  -  (x2  +  j2)3,  x2z2). 

The  boundary  of  V  is  composed  of  the  cylinder  {(v,  y ,  z)  :  x2  +  y2  =  1 ,  0  <  z  <  1} 
and  the  flat  discs  {(v,  y,  z)  :  x2-\-y2  <  1,  z  =  0}  and  {(v,  y,  z)  :  x2-\-y2  <  1,  z  =  1} 
(Fig.  15.1). 

The  outward  normal  on  the  curved  surface  at  the  point  (x,  y ,  z)  is  (v,  y ,  0).  On  the 
upper  and  lower  discs  the  outward  normals  are  (0,  0,  1)  and  (0,  0,  —1)  respectively. 
On  the  curved  surface  F  •  n  =  0  since  x2  +  y2  =  1  and  the  third  coordinate  of  n 
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Fig.  15.1 


is  zero,  on  the  bottom  F  •  n  =  0  since  z  =  0  and  on  the  top  F  •  n  =  x2.  By  the 
divergence  theorem 


/// 

v 


div(F  )dxdydz  = 


x2dxdy, 


x2+y2<  1 


The  double  integral  is  evaluated  by  using  the  polar  coordinates 


(r,6) 


->  (r  cos  6,  r  sin  6),  O<r<l,O<0<27T 


on  the  unit  disc  and  equals  jt/4. 

Example  15.3  We  use  the  divergence  theorem  to  evaluate 


(x2  +  y  +  z) 


aw 


where  W  is  the  solid  sphere  x2  +  y2  +  z2<  1  and  3W  is  its  boundary  oriented  out¬ 
wards.  The  unit  sphere  has  outward  normal  n(v,  y,  z)  =  (x,  y,  z).IfF  =  (/i,  /2,  fa) 
is  a  vector  field  on  W  then  the  divergence  theorem  implies  that 


jl(fi ,  fi  i  h) 

aw 


ff  (xf\  +  yh  +  zfo)dA 

dW 


dxdydz . 


(15.1) 

(15.2) 


Hence  we  can  use  any  smooth  vector  field  on  the  closed  unit  sphere  for  which 
xfi  +  yfi  5- zf 3  =  x2  +  y+Z-  This  allows  a  wide  choice  but  in  such  cases  the  simplest 
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Fig.  15.2 


and  most  obvious  option  is  usually  the  best.  Let  f\(x,  y,  z)  =  x,  fz(x,  y ,  z)  =  1  and 
/3(v,  y,  z)  =  1.  Then 


+  y  +  z)dA  = 


dw 


Idxdydz 


4 

=  Vol  (  W)  =  -71. 


Example  15.4  In  this  example  we  verify  a  particular  case  of  the  divergence  theorem. 
Let  U  denote  the  set  defined  by  the  inequalities  z  >  0,  X2  +  y2  +  Z2  <  4  and 
v2  +  y2  <  z2.  These  define  the  set  of  points,  above  the  vy -plane,  contained  in  the 
sphere  of  radius  2  centred  at  the  origin,  which  lie  within  the  cone  x2  +  J2<Z2 
(Fig.  15.2). 

The  boundaries  of  the  solid  sphere  and  cone  are  given  by  the  equalities  x2  +  y2  + 
z2  =  4  and  x2  +  y2  =  z2  respectively.  These  intersect  in  a  curve  which  satisfies  both 
equations,  i.e.  x2  +  y2  +  z2  —  4  =  z2  +  z2 .  Hence  z  —  =b \fl  and  x2  +  y2  =  2.  This 
is  a  circle  of  radius  \fl  parallel  to  the  vy-plane,  \J 2  units  above  it  with  centre  at  the 
point  (0,  0,  \/2).  The  open  set  U  is  that  portion  of  the  cone  which  lies  between  the 
planes  z  =  0  and  z  =  V2  capped  by  the  top  of  the  sphere  (Fig.  15.3).  The  boundary 
of  U  consists  of  the  portion  Si  of  the  cone  defined  by  x2  +  y2  =  z2, 0  <  z  <  \/2,  and 
the  portion  of  the  sphere,  S2,  defined  by  x2  +  y2  +  z2  =  4,  \fl  <  z  <  2.  Consider 
the  vector  field 

F(x,  y,  z)  =  (xz,  yz,  x2  +  y2). 

9  9  9 

We  have  div(F)  =  — (xz)  H - (yz)  H - (x2  +  y2)  =  2 z.  To  evaluate 

dx  9y  dz 


III 

u 


Izdxdydz 


we  use  spherical  polar  coordinates  (Example  10.5): 
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Fig.  15.3 


w:  (r,  6,  x/s) 


->  (r  sin  0  cos  x/r,  r  sin  6  sin  x/r,  r  cos  0). 


From  Fig.  15.3  the  domain  of  w  is 


Tt 


0  <  0  <  — ,  0<r<2,  0  <  xlr  <  2n. 

4  r 


Since  the  rows  of 


w  = 


sin  0  cos  xj/ 
r  cos  6  cos  xf/ 
—r  sin  6  sin  x)r 


sin  0  sin  xj/ 
r  cos  0  sin  x/r 
r  sin  0  cos  xj/ 


are  easily  seen  to  be  mutually  orthogonal 


det(w  )  =  1  •  r  •  r  sin  6  =  r  sin  9. 


Hence 


11! 

U 


Izdxdydz  = 


r  sin  6  •  r  cos  OdrdOdx/z 


*2  r2n 

r3dr  •  /  dx/z 


=  2tt 


0 


( 


ro 

—  cos  20 


) 


0 

71/4  16 

=  2n  •  — 

o  4 


1 

2 


-  =  47T. 


To  verify  the  divergence  theorem  we  must  show 
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F  =  4  it 


where  Si  and  S2  both  are  oriented  outwards.  To  ensure  that  we  do  not  take  an  incorrect 
sign  we  use  the  formula 


jj  (F  ■  n)  J~EG 


S  5 


to  evaluate  the  surface  integrals  where  n  is  the  outer  normal.  Since  Si  is  part  of  the 
level  set  x2  +  y2  —  z2  =  0  its  unit  normals  are 


± 


(2x,  2 y,  -2 z) 

/  \  1/2 

(( 2v )2  +  (2y)2  +  (2z)2) 


(x,  y,  -z) 

V2z 


On  Si,  z  >  0  and  from  Fig.  15.3  the  outer  normal  has  a  negative  z-component  and 
hence 


niU,  y,  z) 


(x,  y,  -z) 
\f2z 


Hence 


F  ni  =  (xz,  yz,  x2  +  y2) 


(x,  y,  -z)  x2z  +  y2z  -  z(x2  +  y2) 


\[2  z 


\[2  z 


=  0 


and 


Since  S2  is  part  of  the  sphere  of  radius  2  centred  at  the  origin  the  outer  normal  at 

(x,  y,  z)  is  (x,  y,  z)/2.  Hence 


F  n2  =  (xz,  yz,  x2  +  y2)  • 


(x,y,z) 

2 


v2z  +  y2z  +  (x2  +  y2)z 

2 


(■ x 2  +  y2)z . 


We  parametrize  the  sphere  using  spherical  polar  coordinates  (Example  10.5)  and 
obtain 

g :  ( 0,\/r ) - >  (2  sin  0  cos  x/r,  2  sin  6  sin  x/r,  2  cos  6). 


From  Fig.  15.3,  the  domain  of  g  is  0  <  0  <  7t/4,  0  <  x/r  <  2n .  We  have  already 
seen  in  Example  1 1.3  that 


II go  x  gf 


ZEG  -  F2  =  22  sin  6 
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and  from  the  above 


F  •  112  =  4  sin2  0  •  2  cos  0  =  8  sin2  0  cos  0. 


Hence 


32  sin3  6  cos  OdOch/f. 


If  u  =  sin  0  then  du  =  cos  OdO  and 


and 


=  6471  • 


1 

-  =  471 
4 


///div(r) 

u 


F. 


We  have  this  verified  the  divergence  theorem. 

Hopefully  we  have  developed,  in  the  last  few  chapters,  certain  skills  in  handling 
diagrams,  parametrizations  and  linear  algebra  and  seen  that  problems  in  integration 
theory  are  handled  by  a  mixture  of  techniques  and  ideas.  With  some  practice  readers 
will  find  their  own  preferred  approach  and,  indeed,  recognise  that  there  are  alternative 
approaches  to  a  number  of  problems.  Our  next  example  is  similar  but  somewhat  more 
complicated  than  the  previous  example.  We  could  take  the  same  approach  but  instead 
use  a  few  different  ideas  (some  call  them  tricks)  which  may  be  useful  elsewhere.  This 
example  also  highlights  the  relationship  between  the  change  of  variables  for  triple 
integrals  and  parametrizations  of  the  boundary  surfaces.  Recall  that  we  obtained 
many  of  the  change  of  variables  formulae  for  solids  by  ‘‘filling  in”  parametrization 
of  boundary  surfaces  (Example  14.4).  In  the  next  example  we  see  that  restricting 
parametrizations  of  solids  to  the  boundaries  leads  to  parametrizations  of  surfaces. 

Example  15.5  In  this  example  we  verify  the  divergence  theorem  on  a  part  of  a  solid 
torus  (Example  11.1).  Consider  the  region  A  in  the  yz-plane  determined  by  the 
inequalities  (y  —  b )2  +  z2  <  a  and  b  —  y  <  z  <  y  —  b  where  b  >  a  >  0.  The 
boundary  of  A  consists  of  an  arc  of  a  circle  of  radius  a  with  centre  (b,  0)  and  two 
straight  lines  which  pass  through  the  centre  of  the  circle.  These  are  easily  sketched, 
Fig.  15.4,  and  we  will  constantly  refer  to  this  simple  diagram  as  it  contains  a  great 
deal  of  information. 

If  we  revolve  A  about  the  z-axis  we  obtain  a  wedge-shaped  portion  U  of  the  solid 
torus  (see  Fig.  15.4  for  a  sketch  of  the  torus).  We  parametrize  the  solid  torus  by  filling 
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Fig.  15.4 

in  the  toroidal  polar  coordinates  given  in  Example  11.1  and  use  Fig.  15.4  to  find  the 
domain  of  the  parametrization  P  of  the  wedge  U . 

We  obtain 

P:  (r,  6,  x/r) - >  (( b  +  r  cos  0)  cos  x/r,  (b  +  r  cos  0)  sin  xfr,  r  sin  0) 

where  0  <  r  <  a,  —tt/4  <  0  <  it/ 4  and  0  <  x/r  <  2n .  Consider  the  vector  field 

F(x,  y,  Z )  =  (x  ^(x2  +  y2'^1  -bj,y  ^(x2  +  y2^j  1  -bj,z  (x2  +  y2^  ^ 

In  Cartesian  coordinates, 

div(F)  =  ( v 2  +  y2)1^2  —  b  +  x  •  -  •  2x(x2  +  y2)~1^2  +  (x2  +  y2)1^2  —  b 

+  y-\-  2 J (*2  +  J2)“1/2  +  (x2  +  y2)l/2 

=  3(x2  +  y2)1^2  —  2b  +  (x2  +  y2)  •  (x2  +  y2)~1^2 
=  4(x2  +  y2)l/2  —  2b 

and,  in  toroidal  polar  coordinates, 

div(F)  =  4^(b  +  r  cos  0)2  cos2  xjs  +  (b  +  r  cos  0)2  sin2  xj/ 

=  4 (b  +  r  cos  0)  —2b  =  2 (b  +  2 r  cos  0). 


-2b 
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Since 


P'  =  (Pr,  Pe,  P+) 


cos  0  cos  x/r 
cos  0  sin  x/r 
sin0 


— r  sin  0  cos  xj/ 
—r  sin  0  sin  x/r 
r  cos  0 


—  {b  +  r  cos  0)  sin  xfr 
(b  +  r  cos  0)  cos  xj/ 
0 


and  Pr ,  P$  and  P f  are  easily  seen,  by  inspection,  to  be  mutually  perpendicular  we 
have 


det^)!  =  1  •  r  •  (b  +  r  cos  0). 


Hence 


u 


* a  p7i  /  4  p2jr 


div(F)  = 


2  (b  +  2  r  cos  0)r  (b  +  r  cos  0)drd0dxj/ 


0  J-7T/4J0 
(r)  (0)  (V^) 

■7t/4  pa 

=  4n 


=  4tv 


(b2r  +  3r2/?  cos  0  +  2 r3  cos2  0)drd0 

7r/4  J  0 

^/4  / 


—  7T/4 


+  ba 3  cos^  + 


a4  1  +  cos  20 


) 


47r02a2  7 r  o  . 

- •  — h  47r/?aJ(sin0) 


2  2 

717/4  47T^4 

+^(0+ 

— 7T  /  4  4 


(' 


J0 

sin  20 


) 


=  Tt2b2a2  +  4\[2jibcr)  4 - b  Tta4. 


n/4 

—n/4 


To  verify  the  divergence  theorem  we  calculate  the  integral  over  the  boundary  of  U 
oriented  outwards.  The  boundary  of  U  is  obtained  by  revolving  the  boundary  of 
A  about  the  z-axis.  From  Fig.  15.4  we  see  that  the  boundary  of  A  consists  of  an 
arc  B 1  of  a  circle  which  revolves  into  a  part  of  the  torus  and  two  straight  lines, 
B2  and  #3,  which  revolve  into  portions  of  cones.  We  denote  by  S /  the  outwardly 
oriented  surface  obtained  by  revolving  Bi  about  the  z-axis,  i  =  1,  2,  3.  We  could 
proceed  to  parametrize  each  of  these  surfaces  using  the  methods  developed  earlier, 
but  instead  we  obtain  our  parametrizations  by  appropriate  restrictions  of  P.  For 
example  B\  is  the  boundary  of  A  obtained  by  letting  r  =  a  in  the  parametrization 
(r,  0)  — >  (b  +  r  cos  0,  r  sin  0)  of  A  and  if  we  let  r  =  a  in  P(r,  0,  xfr)  we  obtain  a 
parametrization  /  of  Si  given  by 


/(0,  xfr)  =  ({b  +  a  cos  0)  cos  xfr,  (b  +  a  cos  0)  sin  x/r,  a  sin  0) 


where  —  7r/4  <  0  <  7r/4  and  0  <  x/r  <  2tt  .  Since  partial  derivatives  are  calcu¬ 
lated  by  fixing  all  except  one  of  the  variables  we  have  fe(0,  xj/)  =  Pg(a,Q,  x/r)  and 
fy(0,  xfr)  =  P^(a,  0,  x/r).  Thus  the  partial  derivatives  of  /  are  the  final  two  columns 
of  P'  and  as  the  columns  of  P’  are  mutually  perpendicular  the  first  column  is  parallel 
to  the  normal  of  S\.  Since  the  first  column  of  P’  is  a  unit  vector  it  follows  that  the 
unit  normals  to  Si  are 
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d=(cos  0  cos  x/s,  cos  0  sin  xj/,  sin  0). 

If  0  =  0  and  xj/  =  7r/2  we  get  the  point  (0,  b  +  a,  0)  in  S i  with  unit  normals  at 
this  point  given  by  ±(0,  1,0).  From  Fig.  15.4  the  outward  normal  is  (0,  1,0).  Hence 
the  outward  normal  at  any  point  of  Si  is  given  by  (cos  0  cos  x/r,  cos  0  sin  x/r,  sin  0). 
Again,  using  orthogonality  of  the  columns  of  P  we  see  that 

II  fe  x  fir\\  =  ll/ell  •  II  II  =a(b  +  a  cosd). 

On  Si,  x2  +  y2  =  (b  +  a  cos  6)2  cos2  \js  +  (b  +  a  cosd)2  sin2  if  =  (b  +  a  cosd)2 
and 

F  =  (a  (b  +  a  cos  0)  cos  xjr  cos  0,  a  (b  +  a  cos  0)  sin  x/s  cos  0,  a  sin  0  (b  +  a  cos  0)) 
=  a(b  +  a  cos0)(cos  xj/  cos  6,  sin  xfr  cos  9,  sinO) 

=  a(b  +  a  cos0)n. 

Hence 


F  = 


Si 


JJ  F  •  n  ||  fe  x  ff\\d6df 


Si 


>2n  nil/ A 


0  J—n/A 
0/0  ( 0 ) 

*7r/4 


a(b  +  a  cos  0) n  •  a{b  +  a  cos  6)nd\lrd6 


=  2ttu2 


=  2ttci2  • 


(b2  +  2(2/?  cos  0  +  a2  cos2  0)d0 


—  7T I A 

b2  7X  3 

- 1-  Ana^bismO) 


71  ^  4  f  1  +  cos  20 

-\~  2tc  cl  /  - dO 

-n/A  J-jt/A  2 


=  7 r2a2b2  +  Ana2bV2  +  na4 h 

2  2 

=  Tt2a2b2  +  A\f2ncv)b  4 - F  7ra4, 


7i  27ra4  /cos  20  \  l77"^4 


^cos  20  ^ 


—n/A 


The  boundary  B 2  of  A  is  obtained  by  letting  0  =  7r/4  in  the  parametrization  (r,  0)  — > 
(Z?  +  r  cos  0,  r  sin  0)  of  A  and  hence  a  parametrization  of  S2  is  obtained  by  letting 
0  =  7r/4  in  P(r,  0,  xjr).  This  gives  the  parametrization 


— ) 

•JV 

with  domain  0  <  x//  <  2tt ,  0  <  r  <  a.  As  before  the  partial  derivatives  of  g  are 
obtained  from  the  first  and  third  columns  of  P\  with  0  =  7t/4,  and  again,  using 
orthogonality  of  the  rows  of  P',  we  see  that  the  unit  normals  to  S2  are  obtained  by 


g(r,  xlr)  =  [[b  + 


0 
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normalising  the  second  column  of  P'.  The  normals  are  ±(cos^,  sin  x/s,  — 1)/a/2. 
From  Fig.  15.4  the  outer  normal  has  a  positive  final  coordinate  and  thus  the  outer 
normal  at  the  point  g(r,  x[r)  is  (—  sin  —  cos  x/s,  1) /a/2.  Figure  15.4  also  shows  that 
5*2  is  part  of  the  cone  v2  +  y2  =  (z  +  b)2.  On  S2, 


(. x 2  +  y2)112  -b  =  b+  -t—-b  = 

n/2 


V2' 


Hence 

F «,.  *))  =  (("  +  4)  •  7  (" +  -^)  ***■  7 4(<" +  7) 


F7) 


r  \  r 


72/  72 


72  V  77 

(cos  x/r,  sin  x/f,  1) 


Hence 


F  •  n  =  (6  +  — ) 


1 


72  72  72 


(—  sin2  xj/  —  cos2  xj/  +  1)  =  0 


and 


F  =  0. 


s2 


The  surface  S3  is  obtained  by  letting  0  =  —it/ 4  and  we  obtain  in  the  same  way  a 
parametrization 


h :  (r,  xj/) 


with  outer  normal  (  — 


checked  that 


( 


cos  xj/  sin  xj/ 

72 


7 


at  the  point  h(r,  x//).  It  is  easily 


F  (7i(r,  i/O)  =  I  b  +  -^=  )  •  — —  (cos  x/r,  sin  xj/,  —  1) , 


72/  72 


and 


F  •  n  =  (b  H - —  |  — =  •  — =  f—  sin2  xLr  —  cos2  xLr  +  l)  =  0. 

V  a/2/  a/2  a/2  V  2 


Hence  ff  F  =  0.  We  have  thus  shown 

s3 


div(F)  = 


F  + 


F 
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and  verified  the  divergence  theorem. 

This  completes  our  programme  on  integration  theory.  We  have  concentrated 
on  motivating  the  basic  definitions,  the  concept  of  orientation,  the  main  theorems 
(Stokes’  Theorem  and  the  Divergence  Theorem)  and  geometrical  interpretations.  We 
have  neglected  mentioning  practical  and  theoretical  applications  of  several-variable 
integration  theory.  The  methods  we  have  covered  are  not  trivial  and  need  time  to 
be  mastered.  We  urge  the  reader  to  have  patience  and  to  keep  revising  until  they 
appear  obvious.  Further  studies,  of  a  pure  and  applied  nature  should  then  be  highly 
rewarding.  The  remaining  chapters  of  this  book  are  devoted  to  the  geometry  of 
surfaces  in  M3.  The  examples  of  surfaces  that  arose  in  integration  theory  are  both 
concrete  and  representative  and  should  be  referred  to  frequently  as  a  means  of  appre¬ 
ciating  and  understanding  the  abstract  concepts  we  meet  in  the  final  three  chapters. 


Exercises 

15.1  Prove  the  divergence  theorem  for  the  domain 

U  =  {(x,  y,  z)  :  0  <  x  <  1,  0  <  y  <  1,  0  <  z  <  1}. 

15.2  Use  the  divergence  theorem  to  evaluate 


F 


where  F(x,  y,  z)  =  (xy2,  x2y,  y)  and  S  is  the  surface,  oriented  outwards,  of 
the  cylinder  x2  +  y2  <  3  bounded  above  by  the  plane  x  +  z  =  0  and  below  by 
the  plane  z  =  0. 

15.3  Evaluate 


III 


xdxdydz 


u 


where  U  is  the  tetrahedron  bounded  byx>0,y>0,z>0  and  the  plane 

x  y  z 

-  +  f  +  -  =  1. 

a  b  c 


15.4  Evaluate  directly  and  also  using  the  divergence  theorem 


yz2e  xyzdxdydz. 


0<v,y,z<l 
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15.5  Use  the  divergence  theorem  to  prove 


(x2,  —y2,  3 xz)  =  37 r 


where  S  is  the  outwardly  oriented  boundary  of  the  region 

{(*,  y,  z)  :  X  >  0,  y  >  0,  2  >  0,  x2  +  y1  +  z2  <  4}. 


15.6  Let  S  denote  a  closed  surface  which  bounds  the  open  set  V  in  M3  and  let  n  denote 
the  outward  normal  to  S.  If  /  and  g  have  continuous  first-  and  second-order 


partial  derivatives  and  V'/  = 


,2  r  .  9'/  .  9Z/ 


+ 


+ 


dx2  dy 2  3 z2 


show  that 


r<*8  9/ 

- 8 t 

3n  3 


0 


/v2g  -  gV2/)  dxdydz. 


v 


15.7  Let  ^2  denote  the  region  bounded  by  the  rays  6  =  a  and  0  =  b  and  the  curve 
r  =  f  (6).  Show  that 


1  fb 

Area  (^2)  =  -  f2(0)d0. 

2  Ja 

Sketch  the  curve  r  —  1  +  cos  0  and  show  that  it  encloses  a  region  with  area 
3tt/2. 

15.8  Find  the  area  between  the  plane  z  =  0,  the  paraboloid  z  =  ax2  +  by 2  and  the 
cylinder  x2/c2  +  y2 /d2  =  1. 


Chapter  16 

Geometry  of  Surfaces  in  M3 


Summary  Using  normal  sections  we  define  normal  curvature,  principal  curvatures 
and  Gaussian  curvature.  Geometric  interpretations  and  a  method  of  calculating  the 
Gaussian  curvature  using  parametrization  are  given. 

Our  geometric  study  of  surfaces  in  M3  is  motivated  by  some  very  simple  basic  ques¬ 
tions  such  as;  what  is  curvature  and  how  does  one  measure  it?  Is  there  any  relationship 
between  surface  area  and  curvature?  What  is  the  shortest  distance  between  two  points 
on  a  surface?  We  start  by  taking  an  intuitive  and  non-rigorous  look  at  an  apparently 
very  special  case  and  this  leads  us  to  mathematical  concepts  which  are  both  useful 
and  natural.  The  surface  we  study  is  one  with  which  we  are  already  familiar  and 
this  simple  example  gives  us  everything.  We  have  already  used  all  the  techniques 
that  we  require,  and  all  the  facts  that  we  need  are  known  to  us — we  just  have  to 
look  at  things  in  a  slightly  different  way.  The  surface  S  we  consider  is  the  graph 
of  the  smooth  function  /  where  f(x,y )  is  the  height  above  sea  level.  We  suppose 
/  has  a  local  maximum  at  (vo,  Jo)  and  we  study  S  near  p  =  (vo,  yo,  /(To*  yo))- 
Since  the  point  (vo,  Jo)  is  a  critical  point  of  /  the  tangent  plane  to  S  at  p  is  the 
plane  through  the  point  (vo,  yo,  fix  o,  jo))  parallel  to  the  vy-plane  in  M3.  The  unit 
normals  to  the  surface  at  p  are  ±(0,  0,  1)  and,  for  convenience,  we  choose  (0,  0,  1) 
as  our  unit  normal  n (p).  The  tangent  plane  of  S  at  p  consists  of  all  vectors  of  the 
form  (vi,  V2,  0). 

If  our  notion  of  curvature  is  meaningful  it  should  say  something  when  we  take 
cross-sections  of  a  surface.  For  instance,  if  we  keep  getting  circles  when  we  take 
cross-sections  we  should  not  be  surprised  if  the  surface  is  a  sphere  and  if  each  cross- 
section  is  either  a  line  or  a  plane  we  expect  the  surface  to  be  a  plane.  We  consider 
cross-sections  of  M3  through  the  point  p  which  contain  the  unit  normal  at  p ,  n(p). 
Since  cross-sections  of  M3  are  two-dimensional  this  will  cut  the  tangent  plane  and 
we  can  find  a  unit  tangent  vector  at  p,  v,  such  that  our  cross-section  has  the  form 

p  +  {xy  +  yn(p)  :  x,  y  e  M2}. 

We  identify  this  cross-section  of  M3  with  M2t  ^  by  the  correspondence 
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p  +  xv  +  yn(p)  e  M3  < — >  (x,  y)  e  M2. 

The  intersection  of  this  cross-section  of  M3  and  the  surface  S  is  a  curve  on  the  surface 
called  a  normal  section  of  the  surface  at  p  (Fig.  16.1). 

Using  this  correspondence  we  may  identify  the  normal  section  of  the  surface  with 
a  curve  T  in  Mr  passing  through  the  origin.  We  direct  T  by  requiring  (0,  1)  to  be 
the  unit  normal  at  the  origin  and  calculate  its  curvature  using  the  concept  of  plane 
curvature  in  Chap.  7.  The  curvature  of  T  at  the  origin  in  M2  will  depend  on  the 
unit  vector  v  and  the  choice  of  normal  and  we  denote  it  by  kp(y).  We  call  kp(v) 
the  normal  curvature  of  S  at  p  in  the  direction  v.  By  examining  kp(\)  as  v  ranges 
over  all  unit  tangent  vectors  to  S  at  p  we  hope  to  draw  conclusions  on  the  shape 
of  the  surface  near  p.  For  example,  if  kp(v)  =  1/r  for  all  v  then  we  might  expect 
the  surface  to  approximate  a  sphere  of  radius  r  near  p.  All  normal  sections  are 
paths  on  the  surface  leading  to  the  point  p  (i.e.  to  the  top  of  the  mountain)  and  the 
different  normal  curvatures  distinguish  between  the  steep  and  the  not  so  steep  paths. 
Visualising  circles  of  different  radii  going  through  p ,  with  p  as  their  highest  point, 
gives  us  an  idea  of  the  shape  of  the  surface  near  p.  Of  course  it  is  very  laborious  to 
examine  all  these  curves  and  to  calculate  all  their  curvatures  so  instead  we  examine 
them  collectively  to  see  if  any  particular  features  of  the  set  of  all  normal  curvatures 
captures  the  essence  of  the  shape  near  p. 

Since  /  has  a  local  maximum  at  (xo,  Jo)  and  n (p)  =  (0,  0,  1)  all  normal  circles 
of  curvature  will  lie  on  the  same  side  of  the  tangent  plane  and  on  the  opposite  side 
to  the  normal.  This  means  that  all  normal  curvatures  will  be  negative.  Choosing 
(0,  0,  —  1)  as  unit  normal  at  p  changes  the  signs  of  all  normal  curvatures.  If  we  were 
examining  a  local  minimum  at  (xo,  jo)  and  n (p)  =  (0,0,  1)  we  would  have  found 
that  all  normal  curvatures  were  positive. 

But  changing  the  normal  or  turning  the  surface  upside  down — this  is  equivalent 
to  changing  local  maxima  into  local  minima  and  conversely — does  not  affect  the 
shape  of  the  surface  in  any  way  and  we  conclude:  if  all  normal  curvatures  are  strictly 
positive  or  all  are  strictly  negative  then  the  shape  of  S  near  p  is  similar  to  an  ellipsoid 
(Fig.  16.2).  These  points  are  called  elliptic  points  of  the  surface.  We  can  extend 
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this  analysis  to  arbitrary  critical  points  and  clearly  if,  at  a  point  p ,  normal  curvature 

takes  both  positive  and  negative  values  then  the  surface  near  p  is  similar  in  shape  to  a 

saddle  point.  Such  points  are  called  hyperbolic  points  of  the  surface.  Our  preliminary 

investigation  has  shown  that  information  on  the  shape  of  the  surface  can  be  obtained 

from  the  range  of  values  taken  by  kp  (v)  as  v  varies  over  the  set  of  unit  tangent  vectors 

at  p.  Since  the  range  is  determined  by  the  extremal  values  we  let  k\ (p)  =  max  kp  (v) 

IMI=l 

and  (p)  —  min  kp  (v)  and  call  k\ ( p )  and  k^  (p)  the  principal  curvatures  at  p.  If  the 

1 1 V 1 1  =  1 

maximum  or  minimum  occurs  at  a  vector  v  we  call  v  a  principal  curvature  direction. 

We  consider  the  various  possibilities  that  may  arise  for  k\  ( p )  and  k^  (p) .  If  k\(p)  = 
k2(p)  then  we  call  p  an  umbilic point  and  if  k\  (p)  =  k2(p)  =  0  we  call  p  diflat  spot. 
If  p  is  umbilic  then  kp(\)  =  k\(p)  for  all  v,  ||v||  =  1,  and  if  k\(p)  ^  0  then  the 
surface  near  p  is  similar  to  a  portion  of  a  sphere  of  radius  \/\k\(p)\.lf  pis  a.  flat  spot 
then  all  normal  curvatures  are  zero  and  the  surface  near  p  is  very  flat  and  almost  like 
a  part  of  a  plane.  At  an  umbilic  point  all  directions  are  principal  curvature  directions. 

We  now  consider  non-umbilic  points,  i.e.k\(p)  >  k2(p).1fk2(p)  >  0  then  for  any 
v,  || v ||  =  1,  we  have  0  <  k2(p)  <  kp(y)  and  if  k\(p)  <  0  then  kp(x)  <  k\(p)  <  0 
for  all  v,  ||  v ||  =  1 .  Hence,  in  both  cases,  all  normal  curvatures  have  the  same  sign  and 
near  p  the  surface  is  shaped  like  an  ellipsoid  at  one  of  its  extreme  points  (i.e.  like  the 
surface  ( x/a )2  +  (y/b)2  +  ( z/c )2  =  1  at  the  point  (0,  0,  c) ).  If  k2(p)  <  0  <  k\(p) 
then  the  surface  near  p  is  shaped  like  a  saddle  point. 

To  summarise  our  conclusions  more  concisely  we  introduce  Gaussian  curvature. 

Definition  16.1  The  Gaussian  curvature,  K(p ),  at  a  point  p  on  a  surface  S ,  is  the 
product  of  the  principal  curvatures,  k\(p)k2(p). 

We  have  noted  already  that  kp  (v)  depends  on  the  choice  of  normal — changing  the 
normal  changes  the  sign  of  kp  (v) — but  since  two  changes  of  sign  cancel  one  another 
it  follows  that  K(p)  does  not  depend  on  the  choice  of  normal. 

Although  we  have  only  considered  critical  points  on  the  graph  in  our  analysis  we 
will  see  shortly  that  our  analysis  applies  to  all  points  on  a  surface  and  for  this  reason 
we  state  the  following  result  in  its  full  generality. 
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Proposition  16.2  At  a  non-umbilic  point  on  a  surface  S  in  M3  we  have: 

K(p )  >  0  -<=>  near  p,  S  is  shaped  like  an  ellipsoid , 

K(p)  <  0  -<=>-  near  p,  S  is  shaped  like  a  saddle  point , 

K(p)  =  0  -<=>-  near  p,  S  is  shaped  like  a  cylinder  or  cone. 

At  an  umbilic  point  K(p)  >  0  and 

K(p)  >  0  -<=>  near  p,  S  is  shaped  like  a  sphere , 

K(p)  =  0  -<=>  near  p,  S  is  very  flat. 

Consider  again  the  graph  of  a  function  /  near  a  critical  point  (xo,  Jo)  -  We  suppose 
for  simplicity  that  (vo,  yo)  =  (0,  0)  and  /( 0,0)  =  0.  The  tangent  plane  at  p  — 
(*0,  jo,  /( 0,  0))  =  (0,  0,  0)  is  the  vy-plane  and  consists  of  the  points  {(vi,  v2,0)  : 
vi,  V2  G  R},  and  n  =  n (p)  =  (0,  0,  1)  is  our  choice  of  unit  normal.  Let  v  = 
(vi,  V2,  0)  be  a  fixed  unit  vector.  The  section  of  M3  determined  by  n  and  v  is  the 
plane 

{(tv\,  tV2,  s)  :  t,  s  €  R}  (16.1) 

and  the  graph  of  /  is  the  set 

{(*,  J,  f(x,  j))  :  (x,  j)  €  domain(f)}.  (16.2) 

The  points  which  satisfy  both  (16.1)  and  (16.2)  form  the  normal  section  of  the  surface 
through  p  defined  by  v  and  are  easily  seen  near  p  to  have  the  form 

{{tv i,  tv 2,  f{tv i,  tv 2)),  t  e  [-a,  a]}  =  {tv  +  f(tv  1,  tv 2)11  :  t  e  [-a,  a]}  (16.3) 

for  some  positive  a. 

On  identifying  the  plane  in  M3  spanned  by  v  and  n  with  M~x  ^  we  see  that  the 
normal  section  can  be  parameterised  by 

P(t)  =  (t,  f(tv  1,  tv2)),  t  G  [-a,  a], 

and  may  be  identified  with  the  graph  of  the  function  h\  [—a,  a]  R,  h(t)  = 
f(tv  1,  tv 2).  If  we  parametrize  the  graph  of  h  by  t  — >►  (f,  h(t))  we  obtain  (1,  0)  as 
unit  tangent  at  (0,  0)  and  (0,  1)  as  unit  normal  (see  Chap.  7).  Hence  the  curvature  of 
the  graph  at  the  origin,  directed  by  this  parametrization,  is  the  normal  curvature  of 
S  at  p  in  the  direction  v.  By  Example  7.1  the  curvature  at  (t,  h(t))  is 

ti\t) 

(1  +fc'(f)2)3/2' 
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By  the  chain  rule 

h\t)  =  v\fx(tvi,tv2)  +  v2fy(tvi,tv2) 


and 

h"(t)  =  fxx(tv  1,  tV2)v  1  +  2fxy(tv\,  tV2)v  1V2  +  fyy(tv\,tv2)v\. 

On  letting  t  =  0  we  obtain 

MV)  =  fxx(0,  0)Vj  +  2fxy (0,  0)V1V2  +  /v,y(0,  0^  =  V ///«), 0/v 

where 

„  _  (f xx  (0,0)  fXy( 0,0)  \ 

/(°’0)  V-M0’0)  /yyCO.O)/ 

is  the  Hessian  of  /  at  (0,  0).  (We  have  taken  the  liberty  of  using  v  to  denote  both 
(vi,  v2)  and  (vi,  v2,  0)  and  hope  that  this  does  not  cause  any  confusion — it  is  a 
practice  that  we  do  not  recommend  except  in  special  circumstances.) 

This  is  familiar  territory  and  we  use  our  knowledge  of  the  function  vHf( 0,0/v 
as  v  ranges  over  all  unit  vectors.  In  Chap.  4  we  showed  that  the  maximum  and 
minimum  values  of  v///(o,o/v  over  ||v||  =  1  are  the  eigenvalues  of  ///(0,0)  and  are 
achieved  at  the  corresponding  eigenvectors.  Hence  the  eigenvectors  of  ///(0,  0)  are 
the  principal  curvature  directions  and  p  is  non-umbilic  if  and  only  if  Hf( 0,0)  has 
two  distinct  eigenvalues.  Since  eigenvectors  corresponding  to  different  eigenvalues 
are  perpendicular  (Exercise  1.21)  it  follows  that  at  a  non-umbilic  point  there  exist 
precisely  two  principal  curvature  directions  which  are  perpendicular  to  one  another — 
hence  if  we  know  one  we  can  easily  find  the  other.  Note  that  we  do  not  distinguish 
between  the  directions  v  and  —  v. 

We  now  discuss  the  case  previously  omitted,  i.e.  K  ( p )  =  0,  which  corresponds  to 
a  degenerate  critical  point  of  the  function  /  .If  K(p)  =  0  and  p  is  umbilic  then  since 
K(p)  =  k\(p)k2(p)  and  k\(p)  =  k2(p)  it  follows  that  k\(p)  =  k2(p)  =  0  and  p  is 
a  flat  spot  and  we  have  already  considered  this  case.  If  K  (p)  =  0  and  p  is  not  umbilic 
then  we  have  two  possibilities,  k2(p)  <  0  =  k\(p)  and  k2(p)  =  0  <  k\(p ),  and 
which  occurs  depends  on  the  choice  of  normal.  Since  both  have  the  same  geometrical 
interpretation  we  just  consider  the  first  one.  If  v  is  the  principal  curvature  direction 
associated  with  k\(p ),  i.e.  ky(p)  =  k\(p ),  then  the  normal  section  near  p  in  the 
direction  v  is  approximately  a  straight  line.  If  w  _L  v  =  0  then  w  is  the  other  principal 
curvature  direction  and  the  normal  section  in  this  direction  is  approximately  a  circle 
of  radius  \/\k2(p)\  on  the  opposite  side  of  the  surface  to  the  normal  (Fig.  16.3a,  b). 

Figure  16.3b  suggests  a  cylinder  as  an  example  and  indeed  a  cylinder  has  Gaussian 
curvature  zero  at  all  points.  For  the  cylinder  in  Fig.  16.3c  we  have  k\(p)  =  0  and 
k2(p)  =  —  1/r.  In  general,  if  K(p)  =  0  then  all  normal  circles  of  curvature  will  lie 
on  the  same  side  of  the  tangent  plane  and  all  normal  curvatures  will  have  the  same 
sign,  that  is,  either  all  non-negative  or  all  non-positive.  The  cylinder  and  cone  (see 
Exercise  16.2)  are  typical  examples  of  surfaces  without  umbilics  and  with  K  ( p )  =  0. 
One  should,  however,  not  assume  that  every  surface  with  K(p)  =  0  is  of  this  type  as 
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Fig.  16.3 


Fig.  16.4 


Fig.  16.5 


one  can  find  quite  strange  surfaces  with  K(p)  =  0  at  isolated  points.  Our  geometric 
interpretations  are  meant  as  a  rough  guide  and  as  such  are  reasonably  useful  in 
visualising  the  shape  of  the  surface  but  do  not,  of  course,  explain  the  full  subtlety  of 
many  situations — this  requires  further  analysis. 

So  far  we  have  considered  a  rather  special  situation — a  critical  point  on  a  surface 
which  is  the  graph  of  a  function — and  it  is  time  to  see  how  representative  this  is  of 
the  general  situation. 

What  about  non-critical  points?  Well,  if  looked  at  the  right  way,  every  point  is  a 
critical  point.  For  example,  consider  the  torus  in  M3  and  take  p  as  a  typical  point 
on  the  surface  (Fig.  16.4a).  To  turn  p  into  a  critical  point  we  must  choose  a  new 
coordinate  system  for  M3 .  We  choose  our  first  two  coordinates  so  that  the  tangent 
plane  to  the  surface  at  p  corresponds  to  the  (x,  y) -plane  and  then  take  the  z-direction 
as  one  of  the  normal  directions  (Fig.  16.4b). 

We  define  a  function  0  on  the  tangent  plane  near  p  by  letting  0(v,  y)  denote 
the  distance  squared  from  the  tangent  plane  in  the  n (p)  direction  to  the  surface 
(Fig.  16.5).  This  means  that  the  surface  S  near  p  is  the  graph  of  0  and  that  0,  defined 
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on  the  tangent  space  near  p ,  has  a  local  minimum  at  p.  One  can  carry  out  a  similar 
analysis  at  any  point  on  the  surface,  although  it  is  not  so  easy  to  sketch  at  points 
which  become  saddle  points,  e.g.  the  point  q  in  Fig.  16.4a. 

We  conclude  that  our  analysis  applies  everywhere  on  a  graph.  Our  examination 
of  graphs,  level  sets,  simple  surfaces  and  surfaces  at  the  end  of  Chap.  10  shows  that 
these  objects  are  locally  equivalent  and  since  normal  curvature,  principal  curvatures 
and  Gaussian  curvature  are  locally  defined  our  results  obtained  using  graphs  are  valid 
at  all  points  on  all  surfaces. 

Gaussian  curvature  is  the  most  important  number  that  we  can  geometrically  asso¬ 
ciate  with  a  point  on  a  surface.  To  use  it  effectively  we  need  to  be  able  to  calculate 
it  directly  from  any  parametrization,  without  first  finding  normal  curvatures  or  prin¬ 
cipal  curvatures.  To  find  a  way  to  do  this  we  turn  again  to  normal  curvatures  and 
results  from  Chaps.  7  and  8  on  the  curvature  of  plane  curves.  If  P  is  a  unit  speed 
parametrized  curve  in  M2  then 

P\t)  =  T(t ),  T\t)  =  K(t)N(t),  K(t)  =  ( T\t ),  N(t)).  (16.4) 

Let  <fi  denote  a  parametrization  of  the  simple  surface  S.  Let  p  e  S,\  e 
Tp(S ),  ||  v||  =  1,  and  let  n(p)  denote  a  choice  of  normal  to  .Sat  p.  Let  P  :  [—a,  a]  — > 
T  be  a  parameterized  curve  in  S  with  P(0)  =  p  and  P\ 0)  =  v.  If  s  denotes  the 
length  function  on  T  defined  by  P  and  Q  =  P  o  s~l  then  (see  Chap.  5  and  (8.9)), 

P"it)  =  (s'(t))2Q'Xs(t))  +  s"(t)Q\s(t)). 

Since  ^'(0)  =  1  and  (^(sCO))  _L  n(/?),  (16.4)  implies 

(P"( 0),  n (p)>  =  (2"(s(0»,  n (p))  =  kp (v).  (16.5) 

Since  T  c  S  we  may  suppose 

P  :  t  e  [—a,  a]  cp(x(t),  y(t)) 

where  1 1-»-  x(t)  and  y  i->  y(t)  are  real-valued  smooth  functions  on  [—a,  a].  By  the 
chain  rule 

P\t)  =  x'{t)4>x{x(t),  y(t))  +  y'{t)(py{x{t),  y(t)) 


P"(t)  =x”(t)  ■  (px(x(t),  y(t ))  +  y”(t)  ■  (py(x(t),  y(t )) 

+  ( x’(t ))2  •  (pxx(x(t),  y(t ))  +  2 x'(t)  ■  y'(t)  ■  <pxy(x(t),  y(t)) 

+  ( y\t ))2  •  (pyy(x(t ),  y(f)). 


and 
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If  p  =  (v(0),  y(0)),  v\  =  x^O),  V2  =  /(0)  and  we  write,  for  simplicity,  <px  in  place 
Of  </)x(p),  (p  xx  in  place  of  <pxx(p\  n  in  place  of  n (/?)  etc.,  and  let  l  . —  (( pxx,  n), 
m  :=  {(pXy,  n)  =  (<pyx,  n)  and  n  :=  (</>yy,  n).  Then  P'(0)  =  v  =  vi0x  +  v20v  and, 
since  <fix  and  <fiy  are  tangent  vectors,  (0X,  n)  =  (0y,  n)  =  0.  Hence 

(P"( 0),  n (p))  =  kp(y)  -  v\{(j)xx,  n)  +  2v\ v2((pxy,  n)  +  v2{<pyy,  n) 

9  9 

=  lvl  +  2mv\V2  +  nv2 


for  any  unit  tangent  vector  vg  Tp(S).  If  v  =  v\(px  +  V2<fiy  is  any  non-zero  vector  in 
Tp(S)  then  ||v||2  =  Ev\  +  2Fv\V2  +  Gv\  and 

J  (  V\(px  +  V2(py  \  _  lv\  +  2mviV2  +  nv\  _  lv\  +  2mv\V2  +  nv\ 
H||Vl0x+v20y|9=  NP  =  Evj  +  2Fv1v2  +  Gv22  (  } 

This  is  an  extremely  useful  formula,  see  Exercises  16.4,  16.9  and  17.2. 

The  principal  curvatures  are  the  maximum  and  minimum  values  of  /v2  + 
2mviv2  +  nv \  on  the  set  v\E  +  2viV2E  +  v2G  =  1.  To  simplify  the  notation 
we  let  v\  =  x  and  v2  =  y.  Then 


and 


f{x,  y)  =  lx 2 


+  2  mxy  +  ny 


2 


g(x,  y)  =  Ex 2  +  2Fxy  +  Gy2 


0G) 

-4?  00- 


Sine tEG  —  F2  >  0  it  is  easily  verified  that  {(v,  y)  :  g(x,  y)  =  1}  is  a  compact  subset 
of  Mr  and  Vg  ^  0  on  this  level  set.  Hence  /  achieves  its  maximum  and  minimum 
on  {(v,  y)  :  g(v,y)  =  1}  and  we  may  apply  the  method  of  Lagrange  multipliers.  On 
writing  the  equation  V/(v,  y)  =  AVg(v,  y)  in  matrix  form  we  obtain 


C  :)GTT  o) C) 

and  we  may  rewrite  this  as  follows 

/  /  —  XE  m  -  XF\  /x\  _  / 0\ 
\m  —  XF  n  —  XG  )  \y  )  y0/ ’ 


By  elementary  linear  algebra  the  X  for  which  (16.8)  has  non-zero  solutions  (x,  y) 
satisfy  the  quadratic  equation 

(/  -  XE)(n  -  XG )  -  (in  -  XF)2  =  0 
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or  equivalently 

X2(EG  -  F2)  -  X(En  -  2 Fm  +  Gl )  +  In  -  m2  =  0.  (16.9) 

If  X\  and  A  2  solve  (16.9)  then,  since  K(p )  =  X1X2,  we  have  proved  the  following 
result. 

Proposition  16.3  The  Gaussian  curvature  at  a  point  p  on  a  surface  S  is 

In  —  m2 

=  Yg^f 2-  (1610) 

By  (16.7)  the  principle  curvatures  satisfy 

0  0"  0 :)(;)-© 

and  we  see  that  the  principle  curvatures  are  eigenvalues  of  a  symmetric  matrix  and  we 
recover,  using  Exercise  1 .21 ,  a  result  already  observed  using  graphs:  at  a  non-umhilic 
point  the  principal  curvature  directions  are  perpendicular  to  one  another 

The  appearance  of  EG  —  E2,  previously  encountered  while  calculating  surface 
area  in  Proposition  16.3,  suggests  a  relationship  between  Gaussian  curvature  and 
surface  area.  This  comes  to  light  in  the  Gauss-Bonnet  theorem  (Chap.  18). 

Example  16.4  We  calculate  the  Gaussian  curvature  of  the  surface  z  =  xy.  This 
surface  is  the  graph  of  the  function  fix,  y )  =  vy  and  we  obtain  a  parametrization  0 
by  letting  <p(x,y)  =  (x,y,  xy).  We  have 


fx  =  (1,0,  y),  (j)y  =  (0,  \,x) 

E  =  fx.fx  =  1  +  y2,  F  =  fx-fy  =xy,  G  =  =  1  +  v2 

EG  —  F2  =  (1  +  y2)(l  +  x2)  -  x2y2  =  1  +  v2  +  y2 


4>X  X  0  v  = 


i  j  k 

1  0  y 

0  1  v 


(-y,  -v,  1) 


0x  x  0y  _  (-y,  -v,  1) 

1 1 0x  x  0  y  1 1  (1  +  y2  +  v2)1/2 

fxx  =  (0,  0,  0),  fxy  =  (0,  0,  1),  fyy  =  (0,  0,  0) 

1 


l  —  {fxxi  —  0,  m  —  (fxyi  — 


(1  +  x2  +  y2)1/2  ’ 


n  —  (0yy ,  —  0. 
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Hence,  if  p  =  (x,  y,  xy),  then 


K(p)  = 


—m 


-1 


EG-F 2  (l+x2  +  y2)2 


Since  the  Gaussian  curvature  is  always  strictly  negative  the  surface  z  =  xy  consists 
entirely  of  non-umbilic  points  which,  looked  at  critically,  are  saddle  points. 

Although,  in  general,  it  may  be  difficult  to  find  principal  curvatures  and  principal 
curvature  directions  it  is  possible  in  special  cases  and  this  is  the  content  of  the 
following  proposition.  We  say  that  a  curve  V  on  a  surface  is  a  line  of  curvature  if  its 
tangents  are  principal  curvature  directions  at  each  point. 

Proposition  16.5  Iff :  U  C  Mr  S  is  a  parametrized  surface  then  the  coordinate 
curves  off  are  lines  of  curvature  if  and  only  if  F  =  m  =  0  at  all  non-umbilic  points. 
In  this  case,  at  non-umbilic  points,  the  principal  curvatures  are  1/ E  andn/ G. 

Proof  Fix  p  e  S  and  suppose  the  principles  curvatures  at  p  are  X\  and  X2,  X\  X\ . 
Since  the  lines  of  curvature  passing  through  p  cross  one  another  at  right  angles  it 
follows,  if  the  coordinate  curves  are  lines  of  curvature,  that  F  =  fx  •  fy  =  0.  Since 
the  vectors  (1,  0)  and  (0,  1)  solve  (16.8)  we  obtain  after  making  these  substitutions 
l  —  XE  =  m  =  n  —  XG  =  0.  In  particular,  we  see  that  m  =  0  and  note  also 

{Ai,A2}  =  {i,§}. 

Conversely,  suppose  F  =  m  =  0  at  all  non-umbilic  points.  By  (16.6) 


'P 


( 


vi  4>x  +  V2<P 


y 


II  vi  <px  +  v2(p 


y 


) 


lv\  +  nv\ 
Ev\  +  Gv^ 


when  v  =  vi0*  +  V20y  7^  0.  The  maximum  and  minimum  of  f{v\ ,  V2)  =lv\-\-nv\ 
on  the  set  Ev 2  +  Gv^  =  1  are  easily  found  using  Lagrange  multipliers  and  we  see 

that  {X\,  X2)  =  {p,  ^}.  Since  \\fx\\  =  s/~E  and  ||0V||  =  \/G  this  implies 


n 


G 


and  the  coordinate  curves  are  lines  of  curvature.  This  completes  the  proof.  □ 

Example  16.6  The  surface  obtained  by  rotating  the  graph  of  h:  (a,b)  — >►  R 
(Example  10.4)  is  parametrized  by 


P(t,  0)  =  (t,  h(t)cos0,  h(t)sin0). 

For  this  parametrization  Pt  =  (1,  hf  cos  0,  h'  sin  0)  and  Pq  =  (0,  —h  sin  0,  h  cos  0). 
Hence  E  =  1  +  ( h r)2,  F  =  0,  G  =  h2, 


Ptt  =  (0,  h"  cosO,  h'f  sinO) 
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Pto  =  (0,  —  h'  sin  6,  h!  cos  0) 


and 


Pqq  =  (0,  —hcosO,  —hsinO) 


Moreover, 


Pt  x  Pq  =  (h'h,  —hcosO,  —hsinO) 


and 


n 


(hf,  —  cos  0,  —  sin0) 

(i  +  (fto2) 1/2 


Hence 


/  = 


m 


(Ptt,  n) 

(Ae.  n) 

(fiw.n) 


—ft 


a 


(i  +  (/*02)1/2 


=  0 


A 


(1  +  (/zO2)172 


By  Proposition  16.5  the  coordinate  curves  are  lines  of  curvature  and  the  principal 
curvatures  and  Gaussian  curvature  are 


/ 


—h 


// 


n 


1 


(i  +  ( h’)2 ) 


2^3/2’  G 


h(  1  +  (h')2) 


2\!/2 


and  K  = 


—h 


// 


^(l  +  (fto2) 


An  interesting  case  occurs  when  h(t )  =  ccosh(f/c).  This  is  the  shape  assumed 
by  a  hanging  chain  and  is  called  a  catenary.  The  surface  of  revolution  is  called  a 
catenoid.  Using  the  identities 

d  d  22 

(cosh t)  =  sinhf,  — (sinhf)  =  cosh  t,  cosh  t  —  sinh  t  =  1, 


dt 


dt 


we  see  that  the  principal  curvatures  for  the  catenoid  are  ±  1  /c  cosh2  (t/c).  Hence  k\  + 
k2  =  0.  A  surface  with  this  property  is  called  a  minimal  surface.  This  terminology 
arose  in  the  following  way.  Take  a  closed  curve  in  M3  — shaped,  for  instance,  from  a 
piece  of  wire — and  place  a  bubble  over  it.  This  will  assume  a  certain  shape  in  order 
to  minimise  a  physical  quantity  on  the  boundary  called  surface  tension.  The  shape 
assumed  by  the  bubble  is  a  minimal  surface.  The  catenoid  is  the  only  surface  of 
revolution  which  is  a  minimal  surface.  On  minimal  surfaces  feCp)  =  —k\(p)  and 
hence 

K(p)  =  ki(p)k2(p)  -  -ki(p)2  <  0. 


The  quantity  2 ^  is  called  the  mean  curvature. 
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In  this  chapter  we  have  covered  a  lot  of  theoretical  and  practical  material  in  identi¬ 
fying  and  calculating  Gaussian  curvature.  In  the  next  chapter  we  will  summarise  the 
information  we  already  have  on  this  important  concept  and  discuss  further  geometric 
implications. 


Exercises 


16.1  Calculate  the  Gaussian  curvature  at  an  arbitrary  point  of  the  helicoloid  parame¬ 
trized  by 


P(t ,  6)  =  ( t  cos  0,  t  sinO,  bO) 


t  e  M,  0  e  (0,  27t)  and  b  a  non-zero  real  number. 

16.2  Find  the  principal  curvatures  and  the  Gaussian  curvature  at  an  arbitrary  point 
on  the  cone  z2  =  x2  +  y2.  Show  that  the  cone  has  no  umbilics. 

16.3  Let  S  denote  the  surface  parametrized  by  <fi{u,  v)  =  (u,  v,  u2  +  v2),  (u,  v)  e  Mr. 
Show  that  the  curve  V  parametrized  by 

?  1 

Pit)  =  0(r,  t),  --  <  t  <2, 

lies  in  S.  Find  the  unit  tangent  to  T  at  P(l)  and  find  the  (absolute)  normal 
curvature  to  S  at  P(l)  in  the  direction  P'(  1). 

16.4  If  P:  U  c  M2  — >  M3  is  a  parametrization  of  a  simple  surface  S  show  that 
a  point  p  is  an  umbilic  point  if  and  only  if  there  exists  a  real  number  a  such 
that  (l,m,n)  =  a(E,  F,  G)  where  each  term  is  evaluated  at  p.  Hence  find  all 
umbilics  on  the  surface  z  =  xy. 

16.5  Find  the  elliptic  and  hyperbolic  points  on  the  surface  parametrized  by  0  (u ,  v)  = 
(, u ,  v,  u 3  +  v3). 

16.6  Prove  that  the  surface  parametrized  by 

f(s ,  t)  =  (cos s,  2  sin  s,  t ),  0  <  s  <  2tt,  t  >  0 

has  constant  Gaussian  curvature.  Describe  this  surface. 

16.7  If  the  surface  S  is  defined  by  the  equations 

x  =  a(u  +  v),  y  =  b{u  —  v),  z  =  uv 
show  that  the  coordinate  curves  are  straight  fines. 
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16.8  Find  the  Gaussian  curvature  at  an  arbitrary  point  on  the  ellipsoid 


2  2  2 
x  yz  zz 

-T  +  TT  +  -7  =  1. 


a * 


b 2 


cA 


16.9  Show  that  a  surface  S  is  a  minimal  surface  if  and  only  if 


En  +  Gl  =  2  Fm, 


Show  that  the  level  set  ez  cos  v  =  cos  y  is  a  minimal  surface. 


Chapter  17 

Gaussian  Curvature 


Summary  We  define  the  Weingarten  mapping  or  shape  operator.  This  gives  an 

intrinsic  approach  to  Gaussian  curvature. 

We  first  recall,  from  the  previous  chapter,  the  concepts  introduced  and  the  results 

obtained  concerning  the  shape  of  a  surface  near  a  point  p.  All  these  results  were 

obtained  using  plane  curvature  in  Mr.  We  defined  or  obtained  the  following: 

(a)  kp(v),  the  normal  curvature  at  p  in  the  direction  of  the  unit  tangent  vector  v  at 
P 

(b)  principal  curvatures  at  p ,  k\(p)  and  k2(p ),  where 

k\(p)  =  max  k d (v)  and  k2(p )  =  min  kD(\) 

IMI=l  y  IM|=l 

(c)  principal  curvature  directions ,  i.e.  tangent  vectors  Vi  and  V2  such  that 

kp(y{)  =  ki(p)  for  i  =  1,  2 

(d)  umbilic  points,  i.e.  points  where  k\(p)  =  k2  (p),andflat  spots,  i.e.  where  k\(p)  = 
ki(p)  =  0 

(e)  at  an  umbilic  point  all  (tangential)  directions  are  principal  curvature  directions; 
at  a  non-umbilic  point  there  are  precisely  two  principal  curvature  directions  Vi 
and  V2  which  are  perpendicular  to  one  another 

(f)  Gaussian  curvature  at  p 


K(p)  =  ki(p)k2(p)  = 


In  —  m2 
EG  -  F2 


where  0  is  any  parametrization, 


E  —  fix  ’  fix  5  F  —  fix  ’  fiy  •>  G  —  fiy  ’  fiy 
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l  —  fxx  ’  TYl  —  (j)xy  ’  W,  Tl  —  (j)yy  '  n 

fx  X  (py 

n  =  - — 

II0JC  x  0v  II 

(h)  the  coordinate  curves  are  lines  of  curvature  if  and  only  if  F  =  m  =  0  at  all 
non-umbilic  points. 

The  Gaussian  curvature  contains  less  information  than  the  principal  curvatures, 
that  is  to  say  if  we  know  the  principal  curvatures  then  we  can  calculate  the  Gaussian 
curvature  but  from  the  Gaussian  curvature  alone  we  cannot  calculate  the  principal 
curvatures.  Thus,  at  first  glance,  it  appears  that  in  using  Gaussian  curvature  we  may 
be  neglecting  important  information. 

However,  experience  and  subsequent  results  show  that  the  information  lost  is 
generously  compensated  by  other  gains.  To  begin  with,  Gaussian  curvature  is  a  sin¬ 
gle  real  number  assigned  to  each  point  on  a  surface — the  principal  curvatures  and 
directions  involve  two  real  numbers  and  two  vectors.  Thus  Gaussian  curvature  has 
the  advantage  of  simplicity.  We  have  already  seen  that  Gaussian  curvature  may  be 
easily  calculated  from  any  parametrization  whereas  it  may  be  difficult  to  calculate 
the  principal  curvatures.  The  principal  curvatures  depend,  up  to  a  factor  ±1,  on  the 
choice  of  normal  while  Gaussian  curvature  has  the  same  value  for  any  choice  of 
normal.  In  practice  this  means  that  any  parametrization  may  be  used  to  calculate 
Gaussian  curvature  while  only  parametrizations  consistent  with  the  choice  of  nor¬ 
mal  may  be  used  to  find  principal  curvatures.  Indeed,  along  these  lines,  we  have 
a  celebrated  theorem  of  Gauss — theorema  egregium — which  asserts  that  Gaussian 
curvature  is  an  intrinsic  property  of  the  surface.  Roughly  speaking  this  says  that 
Gaussian  curvature  may  be  calculated  directly  from  functions  defined  internally  on 
the  surface  and  without  using  such  external  properties  as  the  normal  or  the  fact 
that  the  surface  lies  in  M3.  Our  method  of  calculating  K  uses  the  normal  so  Gauss’ 
theorem  tells  us  that  there  is  another  way  of  calculating  K  which  does  not  use  the 
normal.  At  first  glance  this  may  appear  a  rather  minor  point  but  it  was  this  result 
which  paved  the  way  for  the  development  of  a  very  powerful  and  a  very  general 
type  of  geometry — Riemannian  geometry — in  which  the  key  concepts  are  differ¬ 
entiation  and  the  length  of  tangent  vectors.  Thus  for  simplicity,  for  practical  and 
intrinsic  reasons,  Gaussian  curvature  has  many  advantages.  However,  in  studying 
surfaces  all  of  the  concepts  we  discussed  play  a  useful  role  and  none  should  be 
neglected. 

So  far  we  have  studied  the  shape  of  a  surface  by  examining  curves  (normal  sec¬ 
tions)  of  the  surface  but  there  are  other  intuitive  approaches  to  the  same  problem. 
Almost  invariably  they  lead  back  to  Gaussian  curvature.  For  example,  following  the 
successful  approach  to  plane  curvature  obtained  by  taking  limits  of  circles,  it  is  nat¬ 
ural  to  regard  the  reciprocal  of  the  radius  of  the  sphere  that  sits  closest  to  the  surface 
near  p  as  a  measure  of  the  curvature  at  p.  For  surfaces  the  definition  of  the  sphere 
of  closest  fit  is  not  so  obvious,  especially  at  saddle  points,  and  an  indirect  approach 
is  taken.  We  use  a  normal  n  to  project  the  surface  near  p  onto  the  unit  sphere  in  M3 
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Fig.  17.1 

and  compare  the  area  of  the  surface  near  p  with  the  area  of  its  image.  When  used  in 
this  way  n  is  called  the  Gauss  map.  If  Bs  is  the  ball  with  centre  p  and  radius  s  then 

Area  (n (S  n  Bs)) 

Area(S  Fi  Bs) 

Note  that  if  S  is  a  plane  then  n(S)  consists  of  a  single  point  and  K(p)  =  0  while  if 
S  is  a  sphere  of  radius  r  then  n(v)  =  ±.x/r  for  all  v  e  S  and  K(p)  =  1/r2.  Further 
geometric  interpretations  appear  in  the  final  chapter. 

We  list  now  a  number  of  results  on  Gaussian  curvature  which  give  some  idea  of 
its  uses;  more  advanced  and  deeper  results  are  given  later.  Think  about  these  results, 
ask  yourself  if  they  are  geometrically  plausible,  how  they  fit  in  with  your  intuition, 
what  they  say  about  the  surfaces  with  which  you  are  already  familiar,  and  how  you 
might  go  about  proving  them. 

(i)  If  S  is  a  connected  surface  in  R3  consisting  entirely  of  umbilics  then  S  is  either 
an  open  subset  of  a  sphere  or  a  plane. 

(ii)  Every  compact  surface  in  M3  contains  a  point  p  with  K(p)  >  0. 

(iii)  A  compact  connected  surface  of  constant  Gaussian  curvature  is  a  sphere. 

(iv)  Hilbert’s  Lemma:  If  p  is  a  non-umbilic  point  in  S ,  k\  has  a  local  maximum  at 
p  and  Jc2  has  a  local  minimum  at  p ,  then  K(p)  <  0. 

Example  17.1  In  this  example  we  discuss  the  torus.  From  Fig.  17.1  we  see  that  at 
the  point  p  the  surface  lies  on  one  side  of  the  tangent  plane  and  K(p)  >  0  while  at  q 
it  lies  on  both  sides  and  K(q)  <  0.  From  Figs.  17.1  and  17.2  it  is  clear  that  1  /a  will 
always  be  a  principal  curvature  and  that  1  /b  —  a  (respectively  \/b  +  a)  is  a  principal 
curvature  at  p  (respectively  q).  We  confirm  this  in  our  analysis. 

We  use  toroidal  polar  coordinates  for  our  parametrization, 

P  :  (6,  <fi)  — >  ((b  +  a  cos  0)  cos  0,  (b  +  a  cos  0)  sin0,  a  sin  0) 

0 0 ,  0)  g  (0,  27r)  x  (0,  2tt).  From  Examples  11.1  and  14.5,  E  =  a2,  F  =  0, 
G  =  (b  +  a  cos  6 )2,  and 


\K(p)\  =  lim 
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Fig.  17.2 

n  =  —  (cos  0  cos  0 ,  cos  0  sin  0 ,  sin  0) 

Pq  =  ( — a  sin  0  cos  0,  —  a  sin  0  sin  0,  acos0) 

=  (— (b  +  a  cos  0)  sin  0,  (b  +  a  cos  0)  cos  0,  0) 

Pee  =  ( — a  cos  0  cos  0,  —  a  cos 0  sin 0,  —asmO) 

Petf)  =  (a  sin  0  sin  0,  —  a  sin  0  cos  0,  0) 

=  (—(/?  +  a  cos  0)  cos  0,  —  (Z?  +  a  cos  0)  sin0,  0) . 

Hence 

9  9  9  9  9 

/  =  (P00 ,  n)  =  a  cos  0  cos  0  +  a  cos“  0  sin  0  +  a  sin  0  =  a 
m  =  (P00,  n)  =  —a  cos  0  sin  0  cos  0  sin  0  +  <2  cos  0  sin  0  cos  0  sin  0  =  0 
/I  =  (P00,  n)  =  (b  +  a  cos  0)(cos  0  cos  0  +  cos0  sin  0) 

=  (b  +  a  cos0)  cos0. 


By  Proposition  16.5  the  coordinate  curves  are  lines  of  curvature,  ^  and 

&  =  fr+C^cos  6>  are  principal  curvatures  and 


In  —  m2  aib  +  a  cos0)  cos0  cos0 

EG  —  F2  u2{b  a  cos  O')2  a(b-\- a  cos  6) 


We  now  have  a  formula  for  Gaussian  curvature  and  also  a  diagram  (Fig.  17.2)  and  we 
may  compare  and  combine  them.  By  differentiating  K(Q,  0),  which  only  depends 
on  0,  we  can  locate  the  points  of  maximum  and  minimum  Gaussian  curvature,  p 
and  q ,  and  this  confirms  what  the  sketch  tells  us.  For  the  torus  F  =  m  =  0.  Hence 
( — ci  sin  0  cos  0,  — cl  sin  0  sin  0,  cl  cos  0)  and  (—(/?  +  a  cos  0)  sin  0,  (b  +  a  cos  0) 
cos  0,  0)  are  principal  curvature  directions  and  the  principal  curvatures,  associated 
with  toroidal  polar  coordinates,  are  1  / a  and  cos  0/(b  +  a  cos  0). 

We  now  consider  a  more  intrinsic  approach  to  Gaussian  curvature.  If  F  :  S  — > 
Wl,  where  (S,  n)  is  an  oriented  surface  in  M3,  p  e  S  and  v  e  Tp(S),  let 
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DyF(p) 


d  / 


t= 0 


(17.1) 


where  P  :  [—a,  a]  — >  T  is  any  parametrized  curve  in  S  with  P(0)  =  p  and 
P'(0)  =  v.  It  can  be  shown  that  this  definition  does  not  depend  on  V  or  P .  The 
mapping  v  — >  DyF(p)  is  a  linear  mapping. 

Let  0  :  U  — >  S  denote  a  parametrization  of  a  part  of  S.  Here  we  use  (px  to 
denote  the  mapping  <p(U)  C\  S  — >  M3  given  by 


y )) 


d 

—  (0(*  +  f,y)) 

dt 


t= 0 


(17.2) 


Using  (17.1)  and  (17.2)  we  define  D^x  px ,  D^tpy,  D^ypx  and  £yv0v.  The  expres¬ 
sions  for  these  derivatives  may  initially  appear  cumbersome,  e.g.  for  instance 

(%,0x)  (0Cu  y))  =  +  s,y  +  t) 

However,  since  0  is  a  parametrization  all  its  partial  derivatives  exist  and  hence, 
using  a  two  variables  Taylor  series  expansion,  we  see  that 

{D<t>(px)((p(x,y))  =  ^-^-(x,y).  (17.3) 

dydx 

We  now  use  the  notation  pxx ,  pxy ,  pyx  and  <pyy,  respectively,  in  place  of  D(j)x  <px , 
D^^.cpy,  D(f)y px  and  <py.  By  (17.3),  <pxy  =  (j)yx  (see  also  the  introduction  to 
Chap.  4). 

The  product  rule  for  differentiation  and  (n (p),  n (/?))  =  1  imply 


(Dyn(p),  n (p))  +  (n (p),  Dvn(p))  =  0 


i.e.  (Dvn(p),n(p))  =  0  at  any  point  p  e  S  and  any  v  e  Tp(S).  Hence  Dyn(p)  _L 
n (p)  and  Dyn(p)  belongs  to  the  tangent  space  at  p.  This  allows  us  to  define  a  linear 
mapping  from  the  tangent  space  at  p  into  itself  by  letting 

Lp  :  v  e  Tp(S )  — >  -Dyn(p)  e  TP(S). 

This  important  mapping  is  called  the  Weingarten  mapping  or  shape  operator.  We  now 
suppose  that  0  is  consistent  with  the  orientation  and  write  n  in  place  of  n  (0(v,  y)). 
Since 

d 

— n(0(*  +  t,  y))\t=o  =  =  -Lppx. 

and  {(px ,  n)  =  0  we  have 

d 

0  =  —  (0*(*  +  Ly),n(0(v  +  t,y))  U=o 
dt 
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and 

(0X5  Fp(fix)  ~  (0XX5  n)  =  /. 

Similarly  w  =  (0-y,  Fp<fix)  =  {(fiXy->  n)  =  {(fiyxi  n)  =  (0*,  Fpfiy )  and  (0y,  Fpfiy)  = 

ffiyy  5  n)  —  n. 

Since  {0X ,  0V}  is  a  basis  for  the  tangent  space  at  p  we  can  find  scalars  {a,  b,  c,  d] 
so  that  Lp((px)  =  afix  +  b<fiy  and  Lp((fiy )  =  cfix  +  d(fiy.  On  solving  the  system  of 
linear  equations, 

Lp((px)  -  <fix  =  aE  +  bF  =  1 
Lp(fix)  •  <fiy  =  aF  +  bG  =  m 
F p (0y )  •  fix  =  cE  +  dF  =  m 

Lp(fiy)  *  fiy  =  CF  +  dG  =  ft 

we  see  that 

_  1  / IG-mF  mE-lF\ 

p  EG  —  F 2  \mG  —  nF  nE  —  mF  ) 

is  the  matrix  for  Fp  with  respect  to  the  basis  {0X,  0V}  for  Tp(S).  Since 

(EG  -  F2)2  det(Ap)  =  (IG  -  mF)(nE  -mF)-  (m  E  -  lF)(mG  -  nF) 

=  (EG  -  F2)(ln-m2) 

we  have  proved,  in  view  of  Proposition  16.3,  the  following  result. 

Proposition  17.2  If  0  parametrizes  a  surface  S  in  M3  then  the  determinant  of  the 
matrix  of  the  Weingarten  mapping  at  p  with  respect  to  the  basis  for  the  tangent  space 
induced  by  0  is  the  Gaussian  curvature  of  S  at  p. 

If  y  =  v\(fix  +  V20y  and  w  =  w\fix  +  wifiy  then,  by  the  above, 

( Fp(y ),  w)  =  (F p(v\(fix  H-  Vlfiy)-)  ttfi0x  H-  ^2 fiy) 

=  VltUl(£p0JC,0JC>  +  (V1W2  +v2uq)(Lp(0x),0;y)  +V2  W2(Fp((j)y),(fiy) 
=  (v,  Lp{ w)) 

and  Fp  :  7^(5)  — >  Tp(S)  is  a  symmetric  linear  operator.  Moreover, 

(Lp(y),  v)  =  lv\  +  lmv\V2  +  nv \ 


and  if  v  is  a  unit  tangent  vector  at  p  then 
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(Lp(v),  v)  =  kp{\).  (17.4) 

Since  Lp  is  symmetric  we  can  choose  an  orthonormal  basis  {ei,  e 2 }  for  Tp(S)  con¬ 
sisting  of  eigenvectors  for  Lp.  Let  f>\  and  f>2  denote  the  corresponding  eigenvalues. 
If  x,  y  e  R,  x2  +  y2  =  1  then  v  =  xe\  +  ye2  is  a  unit  tangent  vector  at  p  and,  by 
(17.4), 


kp(y)  =  (L  p(xe\  +  ye2),  xe\  +  ye2) 

=  (^iei  +  yy02e2,  xe\  +  ye2) 

=  *20i(ei,ei)  +  2xy(ei,  e2)  +  y2(e2,e2) 

=  x2f>\  +  y2/32. 

This  shows  that  the  principal  curvatures  are  the  eigenvalues  of  Lp  and  the  eigenvec¬ 
tors  are  the  principal  curvature  directions.  Since  the  matrix  for  Lp  with  respect 
to  this  basis  is  a  diagonal  matrix  with  the  eigenvalues  as  entries  we  also  have 
K(p )  =  pifo  =  det (Lp).  We  summarise  what  we  have  proved  in  the  following 
proposition. 

Proposition  17.3  At  a  point  p  in  a  surface  S  the  principal  curvatures  are  the  eigen¬ 
values  of  Lp,  the  principal  curvature  directions  are  the  eigenvectors  of  Lp,  and  the 
Gaussian  curvature,  K  (p),  is  the  determinant  ofLp  with  respect  to  any  orthonormal 
basis  for  Tp(S). 

Example  17.4  If  p  is  umbilic  then  all  normal  curvatures  are  equal.  If  p  is  non-umbilic 
and  k\,  k2,  Vi,  and  v2  are  the  principal  curvatures  and  the  corresponding  principal 
curvature  directions  then  any  unit  tangent  vector  v  at  p  has  the  form 

v  =  cos  0\\  +  sin0v2 

for  some  real  number  6.  By  Proposition  17.3 

Lp{\\)  =  ki(p)\i  and  Lp(\2)  =  k2(p)V2 

and,  as  p  is  non-umbilic,  (vi,  v2)  =  0.  By  (17.2), 

kp (v)  =  (. Lp(\ ),  v)  =  (Lp(cos6vi  +  sin0v2),  cos^vi  +  sin0v2) 

=  (cos0L^(vi)  +  sin^L^(v2),  cos^vi  +  sin0v2) 

=  k\(p)  cos2  6  +  k2(p)  sin2  6. 

This  is  known  as  Euler’ s  formula  and  shows  that  normal  curvature  in  any  direction 
can  be  recovered  from  the  principal  curvatures  and  the  principal  curvature  directions. 
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Fig.  17.3 


Exercises 


17.1  Find  the  Weingarten  mapping  for  the  torus. 

17.2  Let  S  denote  the  graph  of  the  function  f(u,v)  =  u2  —  2v2  oriented  by  the 
usual  parametrization.  Find  a  curve  V  in  S  such  that  the  normal  curvature  in 
the  tangent  direction  to  V  is  always  zero.  Hence  find  a  straight  line  that  lies 
inS. 

17.3  Suppose  the  two  oriented  surfaces  S i  and  S2  intersect  in  a  curve  V .  Let  k  denote 
the  curvature  of  V  c  M3  and  let  A;  denote  the  normal  curvature  of  V  in  Si, 
i  =  1,  2.  If  0  is  the  angle  between  the  normals  to  Si  and  S2  show  that 

k 2  sin2  6  =  X2  +  X 2  —  2X1X2  cos  6. 

17.4  Use  — Dv(n)  to  give  another  solution  to  Exercise  16.3. 

17.5  Show  that  the  average  of  the  normal  curvature  over  all  directions  is  the  mean 
curvature. 

17.6  Consider  the  plane  curve  h  described  by  the  following  geometric  condition: 
start  at  the  point  (0,  c)  and  move  so  that  the  tangent  line  always  reaches  the 
v-axis  after  traveling  a  distance  c.  From  Fig.  17.3  deduce  that 


V '  c2  —  h2 
sin# 

(Note  that  sin#  =  — h/c ,  tan#  =  h  =  =.)  Show  that 

yl—  sin2  6 


h 


a 


c2h 


( c 2  —  h2)2 
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Orient  the  surface  of  revolution,  B ,  of  this  curve  so  that  the  principal  curvatures 
are  h' /c  and  —  1  /chr .  Hence  deduce  that  B  has  constant  negative  Gaussian 
curvature.  Sketch  the  surface  B — it  may  help  you  to  know  that  it  is  called  the 
bugle  surface. 


Chapter  18 

Geodesic  Curvature 


Summary  We  define  geodesic  curvature  and  geodesics.  For  a  curve  on  a  surface 
we  derive  a  formula  connecting  intrinsic  curvature,  normal  curvature  and  geodesic 
curvature.  We  discuss  paths  of  shortest  distance,  further  interpretations  of  Gaussian 
curvature  and  introduce,  informally  and  geometrically,  a  number  of  important  results 
in  differential  geometry. 

Our  study  of  normal  curvature  was  based  on  identifying  the  normal  of  a  curve  with 
the  normal  of  the  surface.  It  is  also  possible  to  arrange  things  so  that  the  normal  to 
a  curve  in  a  surface  is  a  tangent  vector  to  the  surface.  This  leads  to  a  new  type  of 
curvature,  geodesic  curvature ,  that  we  discuss  and  interpret  in  this  chapter. 

Let  S  denote  an  oriented  surface  in  M3  with  smooth  unit  normal  n  and  let 
P  :  [a,  b]  — >  r  c  S  denote  a  unit  speed  parametrized  curve  on  the  surface.  Since  the 
tangent  space  at  each  point  on  the  surface  is  two-dimensional  and  Pf  ft)  is  a  tangent 
vector  at  Pft)  it  follows  that  there  are  precisely  two  unit  tangent  vectors  at  Pft) 
which  are  perpendicular  to  P' ft).  We  distinguish  between  them  by  using  the  normal 
to  the  surface  at  Pft),  n(P(t)),  and  define  the  surface  normal  to  r  at  P(t),  ns(P(t)) 
to  be  n (P(t))  x  T(t).  To  simplify  our  notation  we  sometimes  write  n (t)  and  ns (t) 
in  place  of  n(P(/))  and  ns (P(t)),  respectively.  By  construction 

{P'(0  =  r(0,ns(0,n(0} 

is  a  right-handed  orthogonal  system,  and  in  particular,  an  orthonormal  basis  for  M3 . 
In  Chap.  7  we  encountered  a  similar  situation  when  we  obtained  the  orthonormal 
basis  {T,  N,  B]  at  a  point  on  a  curve  r .  In  that  case  we  proceeded  to  obtain  the 
Frenet-Serret  equations  by  differentiation  and  using  properties  of  the  orthonormal 
basis.  We  follow  precisely  the  same  path  to  a  similar  end  here  and  obtain  real- valued 
functions  aft),b(t)  and  c(t )  such  that 

/T(t)\'  !  0  aft)  cft)\  (Tft)\ 

I  ns(0  I  =  (  -aft)  0  bit)  j  j  ns(0  J  .  (18.1) 

V  n(f)  /  \-c(t)  -b(t)  0  /  \  n(0  / 


S.  Dineen,  Multivariate  Calculus  and  Geometry, 

Springer  Undergraduate  Mathematics  Series,  DOI:  10. 1007/978- 1-4471 -64 19-7_1 8, 
©  Springer- Verlag  London  2014 


217 


218 


18  Geodesic  Curvature 


In  particular  we  see  that 


=  —a(t)T  it)  +  b(t)n(t)  (18.2) 

and  the  “part”  of  n's(t)  which  “lies”  in  the  tangent  space  at  P(t )  is  parallel  to  T it). 
This  equation  is  similar  to  Eq.  7.1 1  in  Chap.  7.  We  define  the  geodesic  curvature  of 
r  at  Pit),  Kg(t ),  to  be  ait).  By  (18.2) 

Kgit)  =  — (n s'(f),  Tit))  =  a{t) . 

The  entry  b(t)  is  called  the  geodesic  torsion  of  T  at  P  (, t )  and  written  rg  (, t ).  Rewriting 
Eq.  (18.2)  we  obtain 

n g(0  =  -Kg(t)T(t)  +  Tg(t)n(t) . 

From  (18.1)  and  the  definition  of  normal  curvature  we  have 

C{t)  =  T{t))  =  kp(t) (T (t)) 

and  c(t)  is  the  normal  curvature  at  P it)  in  the  direction  Tit).  Hence,  rewriting  (18.1) 
we  obtain 


/T(0\'  /  0  Kgit)  Knit)\/Tit)\ 

I  ns  it)  J  =  j  -Kgit)  0  r git)  J  I  ns  it)  J  (18.3) 

\  n(0  J  \-Knit)  -t git)  0  /  \  nit)  J 

where  we  have  written  Knit)  in  place  of  kp^ij it)). 

By  (18.3) 

T\t)  =  Kgit)nsit)  +  Kn(t)n(t) .  (18.4) 

but  if  r,  as  a  parametrized  curve  in  M3  has  strictly  positive  curvature,  we  can  add  to 
this,  by  using  (7.E)  and  (18.4),  and  obtain 

T\t)  =  Kit) Nit)  =  Kgit)nsit)  +  Knit)nit) .  (18.5) 

To  avoid  confusion  between  the  different  types  of  curvature  we  call  k  the  intrinsic 
curvature  of  T  and  use  N  to  denote  the  normal  to  T  in  M3  whenever  it  exists.  By 
(18.5)  and  Pythagoras’  Theorem, 

K2(t)  =  K2(t)  +K2it). 

Equation  (18.5)  is  a  decomposition  of  the  intrinsic  curvature  into  its  normal  and 
tangential  components  and  establishes  a  relationship  between  the  three  different 
kinds  of  curvature.  If  we  consider  curvature  as  a  measure  of  “bending”  towards  the 
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normal  then,  since  we  have  chosen  our  normal  ns (t)  to  lie  in  the  tangent  space,  this 
and  (18.5)  suggest  that  we  consider  geodesic  curvature  as  the  surface  curvature  of  the 
parametrized  curve.  So  far  we  have  only  considered  a  unit  speed  parametrized  curve. 
We  define  the  geodesic  curvature  of  a  parametrized  curve  in  an  oriented  surface  as  the 
geodesic  curvature  of  a  unit  speed  reparametrization  of  the  curve  which  preserves  the 
original  sense  of  direction.  Note  that  our  definition  of  geodesic  curvature  is  based,  as 
was  normal  curvature,  on  curvature  in  M2 — see  the  introduction  to  Chap.  7.  Since  the 
two  unit  normals  at  any  point  on  a  surface  are  parallel,  Eq.  (18.4)  shows  that  | Kg | — 
the  absolute  geodesic  curvature — does  not  depend  on  either  the  choice  of  normal 
or  parametrization.  As  surfaces  can  be  covered  by  simple,  and  hence  orientable, 
surfaces  it  follows  that  absolute  geodesic  curvature  is  well  defined  for  any  curve  on 
any  orientable  surface. 

Curves  with  Kg  =  0  are  said  to  have  zero  geodesic  curvature.  We  give  a  geomet¬ 
rical  interpretation  of  this  phenomena  and  afterwards  discuss  a  practical  method  for 
identifying  such  curves.  If  we  are  dealing  with  a  curve  in  M2  then  zero  curvature 
implies  that  the  curve  is  a  straight  line.  If  we  identify  M2  with  the  oriented  surface 
M2x  y)  in  M3  and  consider  a  curve  r  in  M2  as  a  curve  in  M3  then  we  see  easily  that  its 

plane  curvature  in  M2  coincides  with  its  geodesic  curvature  in  M2x  y  In  Mr  we  also 
note  that  a  curve  is  a  straight  line  if  and  only  if  it  follows  the  shortest  route  between 
any  pair  of  its  points.  Now  the  tangent  plane  is  the  closest  plane  to  the  surface  near 
a  given  point  p  and  since  geodesic  curvature  is  essentially  curvature  on  the  tangent 
plane  it  is  at  least  plausible  that  zero  geodesic  curvature  implies  that  r  follows  the 
shortest  path  on  the  surface  between  points  on  r  close  to  p.  This  is  indeed  the  case. 

Formally  we  have  the  following  definitions  and  results.  A  surface  S  is  connected 
if  between  any  two  points  p  and  q  on  S  there  exists  at  least  one  path  (or  directed 
curve)  with  initial  point  p  and  final  point  q .  We  define  the  distance  between  p  and 
q,  d(p ,  q),  as 

inf  {length(y),  y  is  a  path  with  initial  point  p  and  final  point  g} . 

A  path  y  joining  p  and  q  is  a  shortest  path  if 

d(p,q)  =  length (y) . 

Shortest  paths  may  or  may  not  exist  and  if  they  exist  they  may  not  be  unique.  For 
instance  it  is  easily  seen  that  there  is  no  shortest  path  on  the  surface  S  =  {(x,  y,  0)  : 
0  <  x2  +  y2  <  2}  in  M3  joining  the  points  (—1,0,  0)  and  (1,0,0)  although  it  is  easy 
to  see  that  the  distance  between  them  is  2.  On  the  other  hand  there  exist  an  infinite 
number  of  shortest  paths  on  a  sphere  joining  the  North  and  South  poles  (any  line  of 
longitude  is  a  shortest  path).  Equation  18.5  leads  to  a  simple  practical  criterion  for 
identifying  unit  speed  curves  of  zero  geodesic  curvature  since  it  is  easily  seen,  using 
(18.5),  that  P"(t)  (or  N(t))  is  parallel  to  n(t),  for  any  choice  of  normal,  if  and  only 
if  Kg(t)  =  0.  This  motivates  the  following  definition. 
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Definition  18.1  A  parametrized  curve  P :  [a,  b]  ->  r  C  S,  where  S  is  a  surface  in 
M3,  is  a  geodesic  in  S  if  P"  ft)  (the  acceleration  of  the  parametrization)  is  parallel  to 
the  normal  to  the  surface  at  Pft)  for  all  t. 

Our  remarks  above  show  that  a  unit  speed  parametrized  curve  is  a  geodesic  if  and 
only  if  it  has  zero  geodesic  curvature. 

Proposition  18.2  A  parametrized  curve  P  :  [a,  b]  —>  r  e  S  is  a  geodesic  if  and 
only  if  it  has  constant  speed  and  zero  geodesic  curvature. 

Proof  We  may  assume,  without  loss  of  generality,  that  [a,  b]  =  [0,  a].  We  first 
suppose  that  the  parametrized  curve  is  a  geodesic.  We  have 

^(P'(t),P\t))  =2(P"  ft),P\t)). 
dt 

Since  the  curve  lies  in  S ,  P' ft)  is  a  tangent  vector  at  Pft ),  and  as  P  is  a  geodesic 
P"  ft )  is  perpendicular  to  every  tangent  vector.  This  implies  {P' ft),  P" ft))  =0  and 

—  (IIP7  (Oil2)  =  0-  Hence  ||  Pft)  ||  is  a  constant  function  of  t  and  the  parametrization 
dt 

has  constant  speed  c.  This  implies  that  the  parametrization  Q  :  [0,  ca]  — >  r  e  S , 
where  Qft)  =  Pft/c),  is  unit  speed.  Since  Q"ft)  =  P"(t/c)/c 2,  we  see  that  Q"ft) 
is  parallel  to  P"ft/c)  and  hence  to  n {Pft/c))  =  n(Q(t)). 

Conversely  suppose  P  :  [0,  a]  — >  S  has  constant  speed,  c,  and  zero  geodesic 
curvature.  Then  Q  :  [0,  ca]  — >  S  where  Q{t )  =  Pit /c )  is  unit  speed  and  has  zero 
geodesic  curvature.  By  (18.4),  Q" it)  is  parallel  to  nft/c)  and  since  P"(t/c )  and 
Q"  ft)  are  parallel  this  shows  that  the  curve  parametrized  by  P  is  a  geodesic.  This 
completes  the  proof.  □ 

From  the  above  considerations  it  is  not  difficult  to  show  that  a  parametrized  curve 
is  a  geodesic  if  and  only  if  it  satisfies  a  certain  ordinary  differential  equation.  Exis¬ 
tence  theorems  for  ordinary  differential  equations  show  that  any  surface  admits  an 
abundance  of  geodesics.  The  following  is  true. 

Proposition  18.3  If  S  is  a  surface  in  M3,  p  e  S  and  v  e  Tp(S)  is  non-zero  then 
there  exists  s  >  0  and  a  unique  geodesic  P  :  [— e,  s]  ^  S  such  that  P(0)  =  p  and 
P'i  0)  =  v. 

Example  18.4  Let  S2  denote  the  unit  sphere  with  centre  at  the  origin  in  M3.  Let  v 
and  w  denote  perpendicular  unit  vectors  in  M3  and  let 

Pft)  =  cos (at)\  +  sin(at)w,  t  e  R 

where  a  is  a  fixed  real  number.  By  Pythagoras’  Theorem  P  defines  a  parametrized 
curve  in  S2.  We  have 


P"(t)  =  —a2  cos (at)x  —  a2  sin(at)w  =  —a2P(t) . 
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Fig.  18.1 


Since  n(7)  =  ±P(t)  for  the  unit  sphere  with  centre  at  the  origin  this  implies  P"(t)  \ 
n(0  for  all  t  and  P  is  a  geodesic.  Proposition  18.3  shows  that  a  geodesic  is  completely 
determined  by  its  position  and  velocity  at  a  single  point.  Since  P  (0)  =  v  and  P'( 0)  = 
aw  it  follows  that  we  have  found  all  geodesics  on  the  unit  sphere. 

We  return  to  the  problem  we  started  with — the  existence  of  a  shortest  path.  Propo¬ 
sitions  18.2  and  18.3  show  that  there  are  many  curves  on  a  surface  with  zero  geodesic 
curvature.  From  this  it  is  possible  to  prove  the  following  result  which  shows,  at  least 
locally,  that  there  are  always  shortest  paths. 

Proposition  18.5  If  p  is  a  point  on  a  surface  S  in  M3  then  there  exists  e  >  0  such 
that  for  any  q  in  S,  d(p,  q)  <  s,  there  is  a  unique  shortest  path  in  S  joining  p  and 
q.  This  path  has  zero  geodesic  curvature  and  may  be  parametrized  as  a  geodesic. 

Geodesics  also  lead  to  a  new  derivation  of  Gaussian  curvature.  Take  a  point  p  on 
the  surface  S.  An  extension  of  Proposition  18.3  shows  that  there  exists  e  >  0  such 
that  for  every  unit  tangent  vector  v  at  p  the  geodesic  with  initial  point  p  and  initial 
velocity  v  is  defined  on  [0,  s].  If  we  consider  the  set  of  positions  taken  at  time  e  by 
all  unit  speed  geodesics  starting  at  p  we  obtain  a  curve  y£  in  S.  We  denote  the  inside 
of  this  curve  by  As  (Fig.  18.1). 

Thus  ys  consists  of  those  points  in  S  whose  distance  to  p  is  e  and  As  are  the 
points  whose  distance  to  p  is  less  than  s.  It  can  be  shown  that  ys  is  a  closed  subset 
of  S  and  As  is  open.  If  S  is  flat  then  ys  is  a  circle  of  radius  e  and  length  2ns  and 
As  is  a  disc  of  area  ns2.  Hence  the  quantities  2ns  —  l(ys)  and  ns2—A(A£)  where 
/  =  length  and  A  =  Area  are  some  measure  of  the  curvature  of  S.  In  fact,  taking 
limits  we  obtain  the  following 


Thus  both  normal  and  geodesic  curvature  lead  quite  naturally  to  Gaussian  curvature. 
It  is  also  worth  noting  that  the  two  most  important  geometrical  concepts  that  we 
associated  with  a  surface — Gaussian  curvature  and  geodesics — both  turned  out  to  be 
local  properties  independent  of  any  orientations  used  for  calculations  or  motivation. 

To  complete  our  introduction  to  the  geometry  of  surfaces  we  present  without 
proof  some  rather  remarkable  results  involving  the  concepts  we  have  introduced. 
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Fig.  18.2 


These  results  are  easily  stated  while  the  proofs  are  rather  involved.  Even  in  the 
absence  of  any  ideas  regarding  the  proofs  it  is  well  worth  thinking  about  these  results 
and  their  geometric  significance.  You  may  consider  verifying  them  on  some  of  the 
classical  surfaces  we  have  studied,  e.g.  the  sphere,  ellipsoid  or  torus.  Trying  to  prove 
them  will  take  some  time  but  if  you  make  the  effort  and  are  patient  you  will  learn  a 
lot  regardless  of  how  successful  you  are  in  completing  the  proofs. 

The  first  result  we  discuss  is  a  generalization  of  the  well-known  result  in  Euclidean 
geometry  which  says  that  the  sum  of  the  interior  angles  in  a  (plane)  triangle  is  equal 
to  7T.  This  corresponds  to  the  case  where  the  surface  is  a  plane  and  the  Gaussian 
curvature  is  zero.  A  triangle  in  a  surface  is  a  simple  closed  oriented  curve  formed  by 
three  smooth  directed  curves  (Fig.  18.2). 

Each  of  these  smooth  curves  is  called  an  edge ,  the  edges  meet  at  a  vertex  and  the 
interior  of  the  triangle  is  called  its  face.  A  triangle  is  called  a  geodesic  triangle  if 
each  edge  is  a  geodesic.  In  the  plane,  geodesics  are  straight  lines  so  the  usual  triangle 
in  the  plane  is  a  geodesic  triangle.  At  the  vertices  P,  Q  and  R  the  tangents  of  the  two 
curves  which  meet  are  in  the  same  tangent  space  on  the  surface  and  we  can  define 
the  angle  between  them.  It  is  also  possible  to  define  what  we  mean  by  an  interior 
angle  (e.g.  the  angle  a  at  the  vertex  P).  Let  A  denote  the  face  of  the  triangle. 

Theorem  18.6  (Local  Gauss-Bonnet  Theorem)  In  a  geodesic  triangle  on  a  simple 
surface 

J  J  K  =  ^  interior  angles  —  n. 

A 

Stokes’  Theorem  plays  an  important  role  in  the  proof.  For  example  consider 
the  sphere  of  radius  r.  By  Example  18.4  the  lines  of  longitude  and  the  equator  are 
geodesics.  Hence  the  triangle  formed  by  the  lines  of  longitude  corresponding  to 
f  =  0  and  =  tv / 2  and  the  equator  are  a  geodesic  triangle  (Fig.  18.3). 

The  three  interior  angles  are  all  7r/2  and  the  area  of  the  triangle  is  |  x  (total  area 
of  the  sphere)  and  hence  equal  to  47rr2/8.  Since  the  Gaussian  curvature  is  1  /r2  this 
implies 


18  Geodesic  Curvature 


223 


Fig.  18.3 


while 


471  r2 
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TC 
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interior  angles 


Tt 
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and  we  have  verified  the  local  Gauss-Bonnet  Theorem  for  this  triangle. 

Any  compact  (i.e.  closed  and  bounded)  oriented  surface  S  in  M3  can  be  partitioned 
into  a  finite  number  of  triangles  each  of  which  is  oriented  in  an  anticlockwise  direction 
about  the  normal  (Fig.  18.4a).  This  means  that  an  edge  which  is  in  two  triangles  has 
opposite  orientations  in  each  (Fig.  18.4b).  Let  V  denote  the  total  number  of  vertices, 
E  the  total  number  of  edges  and  F  the  total  number  of  faces. 

A  remarkable  result  of  Euler  says  that  no  matter  how  we  partition  the  surface  into 
(not  necessarily  geodesic)  triangles  the  quantity  V  —  E  +  F  remains  unchanged.  We 
call  this  number  the  Euler-Poincare  characteristic  of  S  and  denote  it  by  x(S).  As 
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N  1 


Fig.  18.5 


a  consequence  of  the  local  Gauss-Bonnet  Theorem  we  have  the  following  global 
result: 


Theorem  18.7  (Global  Gauss-Bonnet  Formula)  If  S  is  a  compact  oriented  surface 
then 


K  =  2ttx(S). 


s 


We  call  ff  K  the  total  curvature  of  the  surface  and  it  is  remarkable  that  this 

s 

quantity  is  always  an  integer  multiple  of  2n .  Applying  this  to  a  sphere  of  radius  r 
we  find 

1  4jtr 2 

K  =  — ^  Surface  Area  (sphere)  =  — —  =  2ttx(S) 


and  see  that  x  (sphere)  =  2. 

On  the  other  hand  we  may  partition  the  sphere  by  the  lines  of  longitude  corre¬ 
sponding  to  0,  ±7t/2,  jt  and  the  equator  (Fig.  18.5).  The  outer  edges  are  on  the  back 
of  the  sphere  and  coincide  with  180°  East  or  180°  West  (i.e.  the  international  date 
line).  By  counting  we  get  V  =  6,  E  =  12  and  F  =  8  and  again 


X  (sphere)  =  6—12  +  8  =  2. 


Triangles  can  be  replaced  by  rectangles  in  calculating  the  Euler-Poincare  charac¬ 
teristic  since  each  rectangle  can  be  partitioned  into  triangles  using  diagonals.  This 
doubles  the  number  of  faces  F  and  adds  F  new  edges.  Overall  the  sum  V  —  E  +  F 
is  unchanged.  On  a  box  (Fig.  3.2)  we  have  V  =  8,£  =  12,E  =  6  and  hence 
V  —  E  +  F  =  2.  Since  a  box  can  be  inflated  into  a  sphere  this  shows  once  more  that 
the  Euler-Poincare  characteristic  of  the  sphere  equals  2. 
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(no  holes) 


usual  torus 
(one  hole) 


Fig.  18.6 


Any  compact  oriented  surface  can  be  smoothly  deformed  into  a  surface  with  a 
finite  number  of  holes  (Fig.  18.6).  We  call  the  number  of  holes  the  genus  of  the 
surface.  This  smooth  deformation  does  not  change  the  number  of  faces,  edges  or 
vertices  of  any  partition  and  hence  the  Euler-Poincare  characteristic  is  unchanged. 
It  does,  however,  change  the  Gaussian  curvature  in  many  places,  e.g.  the  sphere 
can  be  changed  into  an  ellipsoid  and  we  know  that  these  do  not  have  the  same 
Gaussian  curvature.  The  global  Gauss-Bonnet  Theorem  says  that  the  total  curvature 
is  unchanged. 

For  any  compact  surface  S  we  have 

X(S)  =  2-2g 

where  g  is  the  genus.  This  implies  that  the  Euler-Poincare  characteristic  is  always 
an  even  integer  and  the  total  curvature  is  an  integer  multiple  of  47r .  Thus  we  have 
a  remarkable  set  of  relationships  between  total  curvature,  the  number  V  —  E  +  F 
and  the  number  of  holes  on  a  compact  oriented  surface  S.  This  is  not  the  end  of  the 
story  as  they  are  all  equal  to  the  index  of  any  smooth  vector  field  on  S.  The  index  of 
a  vector  field  X,  i  (X),  is  obtained  by  assigning  an  integer  to  each  zero  following  a 
prescribed  formula.  Since  x  (sphere)  =  2  it  follows  that  every  smooth  tangent  vector 
field  on  a  sphere  has  at  least  one  zero  and  explains  the  existence  of  the  bald  spot 
which  most  people  usually  have  at  the  point  of  maximal  curvature  on  the  midline  of 
the  calva. 

It  is  interesting  to  speculate  in  a  purely  geometric  way  why  these  things  are  the 
way  they  are.  For  instance,  we  have  seen  that  the  total  curvature  of  a  sphere  does 
not  depend  on  the  radius.  Think  of  blowing  up  a  balloon.  The  bigger  the  radius  the 
larger  the  surface  area.  On  the  other  hand  the  sphere  is  becoming  less  curved,  i.e.  the 
Gaussian  curvature  is  decreasing,  and  over  the  whole  surface  the  increase  in  surface 
area  is  counterbalanced  by  the  decrease  in  curvature.  Another  simple  observation: 
from  the  formula  x  (S)  =  2  —  2g  we  see  that  adding  holes  decreases  the  Euler- 
Poincare  characteristic  and  hence  adds  negative  Gaussian  curvature  to  the  surface. 
Why  should  this  be  so?  If  we  recall  our  study  of  the  Gaussian  curvature  of  the  torus 
(Example  17.1)  we  noted  that  at  the  points  P  and  Q  in  Fig.  18.6  we  had  negative 
Gaussian  curvature  while  at  R  and  S  we  had  positive  Gaussian  curvature.  Adding 
holes  creates  points  like  P'  and  Q'  while  the  outside,  where  the  Gaussian  curvature 
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is  positive,  is  relatively  unchanged  and  it  is  at  least  plausible  that  we  are  increasing 
the  overall  negative  Gaussian  curvature  by  adding  holes. 


Exercises 

18.1  Show  that  every  geodesic  on  the  cylinder  {(x,  y,  z)  :  x2  +  y2  =  1)  has  the 
form 

<p(t)  =  (cos  (at  +  b),  sin  (at  +  b),  ct),  tel. 

18.2  Let  (pit)  =  (x(t),y(t))  ,a<t<b  denote  a  unit  speed  parametrized  curve  in 
Mr  and  suppose  y(t)  >  0.  Let  S  denote  the  surface  obtained  by  rotating  this 
curve  about  the  v-axis.  If 

P(t,d)  =  [x{t),  y(t)  cost!,  y(t)  sind) 
show  that  the  mapping 

<P  :  t - >  P(t,  Go) 

is  unit  speed  and  a  geodesic  for  every  fixed  Go.  Find  the  normal  curvature  in 
the  direction  <p'(t)  at  the  point  P(t,  Go). 

18.3  Prove  that  a  straight  line  which  lies  in  a  surface  is  a  geodesic.  By  using  this 
result  find  for  each  point  P  on  the  surface  z  =  x2  —  y2  two  geodesics  passing 
through  P . 

18.4  Let  P  :  [a,  b]  — >  r  denote  a  unit  speed  parametrized  curve  on  the  surface  S. 
Suppose  r  has  positive  curvature  in  M3 .  Show  that  r  is  a  fine  of  curvature  if 
and  only  if  the  geodesic  torsion  xg  =  0.  If  r  is  a  geodesic  show  that  xg  =  r 
where  r  is  the  torsion  of  r  as  a  curve  in  M3  (Chap.  7). 

18.5  If  r  is  a  directed  curve  in  a  sphere  with  zero  torsion  show  that  r  is  part  of  a 
circle. 

18.6  Let  P  :  [a,  b]  — >►  r  denote  a  unit  speed  curve  with  strictly  positive  intrinsic 
curvature  in  an  oriented  surface  S.  Show  that  the  normal  curvature  at  P(t) 
in  the  direction  P'(t)  is  zero  if  and  only  if  the  osculating  plane  to  the  curve 
coincides  with  the  tangent  plane  to  the  surface.  (A  curve  with  this  property  at 
all  points  is  called  an  asymptotic  curve  on  the  surface.) 

18.7  Show  that  a  curve  in  a  sphere  with  constant  geodesic  curvature  is  part  of  a 
circle. 

18.8  Find  the  Euler-Poincare  characteristic  of  the  torus  by  partitioning  it  into  tri¬ 
angles  and  calculating  V  —  E  +  F.  Verify  your  result  by  calculating  the  total 
curvature  (see  Example  17.1). 

18.9  If  T\  and  T2  are  two  triangulations  of  a  compact  oriented  surface  S  then  T\  C  T2 
(or  T2  is  a  refinement  of  T\ )  if  every  triangle  in  T\  is  a  union  of  triangles  from  T2 . 
Let  Vi ,  Ei  and  F)  denote,  respectively,  the  number  of  vertices,  edges  and  faces 
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in  Ti ,  i  =  1,2.  Show,  by  induction  on  V2  —  V\  that  V\  —  E\ + F\  =  V2  —  E2 + F2 . 
Hence  show  that  the  Euler-Poincare  characteristic  is  well  defined-  that  is  that 
V  —  E  +  F  has  the  same  value  for  any  triangulation  of  S. 

18.10  Let  Ti  denote  a  triangulation  of  the  compact  oriented  surface  S /  of  genus 
gi ,  i  =  1,2.  Let  ti  denote  a  fixed  triangle  in  7} ,  /  =  1,2  and  let  S3  denote  the 
surface  obtained  by  taking  the  union  of  Si  and  S2,  identifying  the  edges  of  t\ 
and  t2  (preserving  orientation)  and  removing  the  face  t\  —  t2.  Show  that  S3 
is  a  compact  oriented  surface  of  genus  g\  +  g2-  Use  this  result,  x  (Torus)  =  0 
and  induction  to  show 

X  (S)  =  2  —  2g 


for  any  oriented  surface  S. 


Solutions 


Solutions,  answers,  hints  or  relevant  remarks  to  selected  exercises  are  provided. 
Unexplained  notation  can  be  found  in  the  text. 


Chapter  1 


1 . 1  The  only  non-empty  open  and  closed  subset  of  M/7  is  W2 .  The  union  of  open 
sets  is  open  and  the  intersection  of  closed  sets  is  closed. 


(a)  Interior  of  a  solid  ellipse,  open  and  bounded,  not  compact. 

(b)  Surface  of  a  cone,  closed,  not  bounded  since  ( n ,  0,  n)  lies  in  the  surface  for 
all  n ,  not  compact. 

(c)  First  octant  in  M3 — like  the  first  quadrant  in  M2.  Closed,  not  bounded  since 
(n,  0,0)  lies  in  the  set  for  all  n.  Hence  not  compact. 

(d)  x2  +  y2  +  (z  —  l)2  =  lis  the  surface  of  a  sphere  with  centre  (0,  0,  1)  and 
radius  1 .  Closed,  bounded  and  hence  compact. 

(e)  Intersection  of  sphere  of  radius  2  and  centre  (0,  0,  2)  and  cylinder  parallel 
to  z- axis  based  on  circle  in  (x,  y)-plane  with  centre  (0,  0)  and  radius  2. 
Intersection  is  circle  in  the  plane  z  —  2  of  radius  2  with  centre  (0,  0,  2), 
compact. 


1.2  (a) 


§£=2,loB<l+,y)  +  2£0£. 

x2y2). 


d£  2(V±V)Vy 
dy  1  +x2y2 


%  -  2-log(l  + 


dg  _ 


%  =Jtan-1(xz)+T^? 


%  =  x  tan 


-l 


(xz), 


dg  _  x2y 

dz  1  +x2z2  ’ 
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Solutions 


1.3  F'(x,  y,  z.  w)  = 


2x 

2y 

z 


V  2xz2w2 


DyF{  1,2, -1,-2)  = 


-1 

V  8 


— 2y 

0 

0  \ 

\ 

2x 

0 

0 

0 

V 

0 

5 

0 

2zw2v2 

2  z 

2wx2  / 

-4 

0  0  \  /  2  \ 

/  0  \ 

2 

0  0 

1 

10 

0 

1  0 

-2 

-4 

0 

-8  — 4  j 

\-l) 

(  36  ) 

1.4  V/O,  y,  z)  =  (2 x-y,  -*+z3,  3yz2-6).  V/(v,  y,  z)  =  (0,  0,  0)  &  2 x-y  = 
0,  —  v  +  z3  =  0,  3 yz2-6  =  0  o  y  =  2x,  z3  =  x,  yz2  =  2  o  y  =  2x,  z3  = 
x,  xz2  =  1  <=>  y  =  2x,  z3  =  x,  z5  =  1  4^  z  =  l,  x  =  1,  y  =  2.  Solution 


(1,2,1). 

1.5  V/  =  (2xey,  x2ey,  0),  Vg  =  (zy2£xz,  2yexz,  xy2exz),  fg  =  x2y2ey+xz. 
V(/g)  =  (2x-y2^}7+xz +x2j2z^}7+xz,  2v2y£};+xz  +x2j2^:v+xz,  x3y2^:v+xz)  = 
x2ey  (zy2exz,  2yexz,  xy2exz)  +  y2exz( 2xey,  x2ey,  0)  =  fVg  +  gVf. 

1.6  If  P(t)  =  ( x\(t ),  X2 (t), . . . ,  xn(t))  then  \\P(t)\\2  =  x2(t)  +  •  •  •  +  x2{t)  and 

|(I|P(0II2)  =  2xl(t)x[(t)  +  - •  .+2 xn(t)x'n(t)  =  2I?=1  ^-(0^(0  =  2P{t)o 
P'(t )  =  2{P(t),  P'  it))  where  we  supposed  for  convenience  that  P(f)  and  P'it) 
are  in  the  same  space.  If  HP  (Oil  does  not  depend  on  t  then  ^(||P(0ll2)  =  0 
and  (Pit),  P' it))  =  0.  The  inner  product  of  two  vectors  is  zero  if  and  only  if 
they  are  perpendicular.  In  this  exercise,  which  is  extensively  used  in  Chaps.  5-8 
and  16-18,  we  used  the  inner  product  notation.  If  we  use  matrix  notation  then 
Pit)  is  a  1  x  n  matrix,  P'(t)  is  an  n  x  1  matrix  and  P(t)  °  P'it)  is  a  scalar. 


1.7  F\X)  = 


0 

2y 

vz 


G\X)  = 


H(x,  y,  z)  =  x2ex  +  y4  -  z4  +  v2y2z2, 

XH(x,  y,  z)  =  (2xex  +  x2ex  +  2vy2z2,  4y3  +  2v2yz2,  — 4z3  +  2 x2y2z). 
Note  F :  M3  —>  M3  hence  Ff  is  a  3  x  3  matrix,  G :  M3  — >  M3  hence  Gr  is  a 
3x3  matrix  and  (F,  G) :  M3  — >  M1  hence  V((F,  G))  is  a  1  x  3  matrix.  This 
means  G  o  F'  +  F  o  Gr  is  a  1  x3  matrix  and  OoG  +  FoG/  would  not  give  the 
correct  answer  since  it  is  not  possible  to  add  a  1  x  3  matrix  and  a  3  x  1  matrix. 

1.8  m  (C*2  +  y2  +  z2r1/2)  = -\{x2  +  y2  +  z2)-3'2  ■  2x  = 

1.9  Let  F  =  (/i, . . . ,  /,„).  Then  ||F||  =  (/t2  +  •  ■ 


— V 


(x2  +  y2  +  z2)3/2 
+  fl)x'2  and  £(\\F\\)  = 


1 


•2 


1 


i(/lZ  +  /2Z  +  •••  +  fm)  1/2  •  -  ||F||  ^j=UJdx 

d  ni  Z7ii\  1  srn  /  T7  dF 


(F,^)/\\F\\,  Vv(||F||)  =  X7=1  Vi 


ZT=1  fj 


9f 


J  


117  “  ||F||  Si  =  l  — 

jL(F,  viw-)  =  ~  n .  We  require  F(P)  ^  0  since  we  divide  by 


F||  ^  ’  ^  =  i  “  ||F|, 

F||.  The  result  is  not  true  otherwise,  e.g.  fix)  —  \x\,  x  e  R,  is  not  dif¬ 
ferentiable  at  the  origin.  Note  the  use  of  the  notation  Vv  in  place  of  Dy  for 
scalar- valued  functions. 
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1.10  Let  F(x i,  . . . ,  xn)  =  (x\,  . . . ,  xn)  in  Exercise  1.9.  Then  DeiF  =  e*  and 

A, (1/11*11)  =  -p^A;(ll*ll)  =  -^De.dlFH)  = 

=  -fj$  =  -Xi/\\X\\\  Hence  V(l/||*||)  =  -*/||*||3. 

1.11  H(x,y,z)  =  (( xyz)2  +  {x2  +  y2)2 ,  (xyz)2  -  {x2  +  y2)2 ,  {x2  -  y2)2  -  z\ 
(x2  -  y2)2  +  zA)  and  Hi(x,  y,  z)  =  (xyz)2  -  (x2  +  y2)2.  Hence  ^  = 
2  xy2z2  —  4x(x2  +  y2). 


/  2  u 

2v 

0 

0  \ 

/yz 

xy\ 

{  ~  ~  ~\ 

2  u 

—2v 

0 

0 

2x 

2  y 

0 

dH2 

dx 

0 

0 

2w 

-It 

2x 

-2  y 

0 

V  0 

0 

2w 

2 1  ) 

\  0 

0 

2  zj 

\  -  -  -) 

yyy  =  2uyz  —  2v'2x  =  2xyz  •  yz  —  2(x2  +  y2) 2x  =  2 xy2z2  —  4x(x2  +  y2). 

1.12  h  is  the  composition 

(xi,  ...,*„)-►  (eXl,  .  ..,ex") 

(yu  •  •  • ,  yn)  f(yu  .  .  • ,  yn) 


dh 


=  e 


Xi 


df  d2h 


dx„ 

2  d2  f 


=  e 


Xi 


df 


y 


dyt  ’  dxi 2 

/2  •  Hence  YH=\  ^2 


+  e 


Xi 


d  (df  \  _  xi  df  ,  „2xi  d2f  _  _  df 
~\Tuy)  —  e - -  — 


+  e^ 


dyi  '  dyiydyiJ  dyt  '  dyt  2 

d2h  _  -yn  df_  ,  -yn  2  <Ff_  _  Q 

—  Zw  =  1  -Vi  «9V.  +  2^  =  1  )/  «V;2  —  u- 


yi  dyi 


+ 


dyt2  •  - - ^*=1  <9*;2  _  ^*  =  1  1  ^*  =  1  ■'i  dy; 

1.14  P  =  (1,1,1),  AX  =  (0.1,0.05,-0.05),  f(P)  =  3,  /(P  +  AX)  = 
3.42628125,  V/(P)  =  (3,5,3),  /(P)  +  V/(P)  •  AX  =  3.40,  Errors 
0.02628125,  Error x  100// (P)  =  0.87604166  %. 


1.15  P_1(C)  is  the  intersection  of  the  cone  z2  —  x2  —  y2  =  1  and  the  plane 
2x  —  y  =  2.  Solving  these  equations  yields  y  =  2x  —  2  and  z2  =  1  +  v2  + 
4(x  —  l)2  =  1  +  5v2  —  8v  +  4  =  (9/5)  +  5(v  —  4/5)2.  This  shows  that  the 
level  set  is  a  hyperbola. 

1.19  The  level  set  can  be  rewritten  as  y6  +  (x  —  y)2  +  (xy  —  4 z)2  =  51,  hence 
y6  <  64  and  |y|  <  2,  (x  —  y)2  <  64  implies  \x\  <  \y\  +  8  <  10.  Hence 
\xy\  <  20  and  (xy  —  4 z)2  <  64  implies  \4z\  <  \xy  \  +  8  <  28.  Hence  \z\  <7. 
This  shows  that  the  level  set  is  bounded.  It  is  also  closed. 

1.21  AX  =  AX,  AY  =  fiY,  A  ^  fi  =>  A(X,  Y)  =  (AX,  Y)  =  ( AX,Y )  = 
(X,  AY)  =  (X,  fiY)  =  /i(X,  Y)  =>►  (A  —  /i)(X,  7)  =  0  =>  X  _L  F  since 
A7 -  fi. 
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2.1  F[(X)  =  VF\(X)  =  (2x\,  —2x2,  0,  0)-  F\  has  full  rank  except  when  x\  = 
X2  =  0.  x\  7^  0  =>  {x2,  x3,  X4}  can  be  taken  as  the  independent  variables, 
X2  7^  0  =>►  {x\,  X3,  x 4}  can  be  taken  as  the  independent  variables.  F\  (P\)  = 
—3.  If  x\  7^  0  let  (x2,  x3,  X4)  =  (x|  —  3)1/2  near  (x2,  x3,  X4)  =  (2,  0,  —1) 


(  2xi 

-2X2 

0 

0  1 

V  0 

0 

2x3 

—  2x4  / 

5 


full  rank  x\  or  X2  7^  0  and  13  or  14  7^  0.  Pairs  of  independent  vari¬ 
ables  (xi,X3),  (xi,X4),  (x2,X3),  (x2,X4).  F2(P2)  =  (1,3),  02(x2,X3)  = 
((1  +  xf)1/2,  -(xf  -  3)1/2)  near  (x2,x3)  =  (0,2),  F^(X)  = 

2xx  -2x2  0  0  \ 

0  0  2jc3  —2x4  I ,  full  rank  any  three  of  the  variables  xi ,  x2 , 

—2xi  0  0  2x4  / 

x3,  X4  are  non-zero.  F3(P3)  =  (—3,  —7,  15),  03(xi)  = 

(7^,  Jx2  +  8,  -Jx2  +  15^  near  xi  =  1.  Other  solutions  also  exist. 

2.2  (i)  x2  +  y2  =  u2  cos2  v  +  u2  sin2  v  =  u2  =>  u  =  zb J x2  +  y2  =>►  = 

n  =  tan_1(y/x) 


zbx/v^2  +  y2,  y/x  =  usinv/ucosv  =  tan  n 


=  -j/U2  +  /). 


dx 

du  dv  du  dv 

(ii)  —  cos  v  —  u  sin  n—  =  1,  —  sin  v  +  u  cos  v—  =  0.  Solving  these  two 


dx 


dx 


dx 


dx 


OOO  r\ 

linear  equations  for  ^  and  ^  gives  us  ^  =  cos  v  and  ^  =  —  sin  v/u 
which  agree  with  (i). 

/  2xix|  2x2x2  0  0  \  /  8  4  0  0 

2.3  F'(X)  =  l  x2x3  xix3  xix2  0  J ,  F'(l,  2,  3, 4)  =  f  6  3  2  0 

\  0  0  0  2xi  /  \0  0  0  8 

hence  8x1  +  4x2  =  0,  6x1  +  3x2  +  2x3  =  0,  8x4  =  0.  Solution  set 
{(xi,  —2xi,  0,  0)  :  xi  e  R},  Basis  =  {(1,  —2,  0,  0)},  Tangent  line  =  {(1  + 
t,  1  -  2t,  3,  4)  :  t  €  R}. 

2.4  (a)  Let  /(x,  y,  z)  =  xey  —  z.  Surface  =  /-1( 0),  V/(x,  y,  z)  =  (ey,xey,  — 1), 
V/(l,  0,  1)  =  (1,  1,  —1),  Normal  line  =  {(1  + 1,  t,  1  —  t)  :  t  e  R},  Tangent 
plane  =  {(x,  y,  z)  :  (x  -  1)  •  1  +  y  •  1  +  (z  -  1)  •  (-1)  =  0}  =  {(x,  y,  z)  : 
x  +  y  -  z  =  0}. 

(b)  Let  F(x,y,  z)  =  (x2  +  y2  —  z2,  x  +  y  +  z).  T  is  the  set  F-1(l,5). 
F'(x.y. z)  =  ^  -f).  F'<1,2.2)  =  (J  <  7) 


Tangent  line  =  (1,  2,  2)  +  {(x,  y,  z)  :  2x  +  4y  —  4z  =  0,  x  +  y  +  z  =  0}  = 
(1,  2,  2)  +  {(x,  y,  z)  :  y  =  3z,  x  =  -4z}  =  {(1  -  At,  2  +  3t,  2  +  t)  :  t  e  R}. 
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2.5  Equation  of  plane  is  ax  -\-by  -\-  cz  =  d.  Using  (1,  2,  3)  and  (4,  5,  6)  we  obtain 
a  +  2b  +  3c  =  d  and  4a  +  5  b  +  6c  =  d.  Since  the  plane  is  perpendicular 
to  the  plane  lx  +  8y  +  9z  =  10  it  follows  that  (a,  b,  c)  •  (7,  8,  9)  =  0,  i.e. 
la  +  8/?  +  9c  =  0.  Solving  for  a,  b ,  c  and  d  gives  b  =  — 2a ,  c  =  a  and  d  =  0. 
Solution  is  {(v,  y,  z)  :  v  —  2y  +  z  =  0}. 

2.6  Let  f(x,  y,z )  =  V*  +  Va  +  v/d,  4>  1)  =  d/2>  l/4>  l/2)-  Tangent 
plane  to  f~\4)  at  (1,4,  1)  is  {(*,  y,  z)  :  (*  -  1)  ■  (1/2)  +  (y  -  4)  •  (1/4)  + 
(z  ~  1)  •  (1/2)  =  0}  =  {(v,  y,  z)  :  2*  +  y  +  2z  =  8}. 

2.7  Let  f(x,y,z )  =  x2  +  4 y2  +  4z2,  Vf(x,y,z )  =  (2v,  8y,  8z).  Tangent  planes 
to  /_1(1)  at  (l/\/2,  1/4,  1/4)  and  (V3/2,  0,  1  /4)  are  V2v  +  2 y  +  2z  =  2  and 
\/3v  +  2  z  =  2.  Line  of  intersection  =  {((,  (V3  —  y/2)t/2,  l—ty/3/2)  :  £  e  R}. 
k  =  distance  squared  of  line  to  the  origin  =  (9  —  2\/6)/(12  —  2\/6)  =  (14  — 
v/6)/20. 

2.8  Substitute  x2  =  1+y2  intov2+2y2  =  4  to  get  1+3 y2  =  4.  Hence  3 y2  =  3  and 
y  =  ±1,  x  =  ±\/2.  Lour  points  are  (±V2,  ±1),  (a,  /?)  =  (\/2,  1).  Tangent 
line  to  hyperbola  at  (\/2,  1),  \flx  —  y  —  1  has  slope  \/2.  Normal  line  to  ellipse 
points  in  direction  (2\/2,  4)  and  hence  has  slope  4/2^2  =  \J2.  Both  lines 
pass  through  (\/2,  1)  and  hence  coincide.  Tangent  lines  meet  at  (0,  ±1)  and  at 
(+1/V2,  0).  Area=  \/2. 

2.9  Direction  of  normal  line  to  paraboloid  at  (1,  1,4)  is  (2,  2,  —1).  Tangent  plane 

at  (1,  1,  4)  is  {(v,  y,z)  :  2x  +  2y  —  z  =  0}.  Normal  line  through  (1,  1, 4)  is 
{(1  +2 1,  1  +2 1,  4  —  t):te  R},  t  =  —9/8  gives  the  point  (—5/4,  —5/4,  41/8) 
on  paraboloid  and  on  normal.  Normal  line  through  (—5 /4,  —5 /4,  41/8)  has  di¬ 
rection  (—5/2  —5/2  —  1 )  cos#  =  (^~5/2,~5/2’~1^42’2,~1^  —  _ =2 _  — 

recuon  t  o/z,  O/Z,  r;,  cost/  (25/4+25/4+l)1/2(4+4+i)l/2  (27.9/2) i/2 


2.10  Let  f(x,y,z )  =  log(v2  +  y2)  —  2z,  S  =  /-1(0)  =  Graph(g),  where 
g(x,y)  =  \  \og(x2  +  y2),  V/(  1,  -1,  \\og2)  =  (1,  -1,  -2).  Tangent  plane 

=  {(v,  y,z)  :  x  —  y  —  2z  =  2  — log  2},  Normal  line  =  {(1  +t,  —  1  —  t,  \  log  2  — 
t)  :  t  e  R}. 

2.11  At  points  of  contact  normals  coincide.  Hence  ( f\x ),  —1,  0)  =  A(2z,  1,  2z  + 
2x)  =+  A  =  —  1  =+  z  =  —v  and  f'(x)  =  (—\)(—2x)  =/  f(x)  =  x2. 
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3.1  Let  g(x,  y,  z)  =  xy  +  yz,  fi(x,  y,  z)  =  x2  +  y2  -  1,  f2(x,  y,z)  -  x  -  yz. 
Vg  =  A1V/1+A2V/2  =>•  (y,x  +  z,y)  =  Ai(2x,  2y,  0)  +  A2O,  —z,  —y), 
y  =  — A2 y  =>  y  =  0  or  A2  =  —  1 ,  y  =  0  v  =  0  which  contradicts 
x2  +  y2  =  1.  A2  =  -1  =>  x  +  z  =  X\2y  +  z  =>  x  =  X\2y  =  yz  =>  z  =  2X\, 
y  =  2X\x  -  l  =  zx  -  l  =  (. x2/y )  -l=^y2=v2-y  =  l-y2-y=> 
2 y2  +  y  —  1  =  0.  Hence  v  =  —  1  or  v  =  1/2,  y  =  —1  =>  v  =  0  and  z  =  0. 
Solution  (0,  —1,  0)  and  g(0,  —1,0)  =  0.  If  y  =  1/2,  x  =  ±V 3/2,  z  =  ±V 3. 
Maximum  =  3\/3/4,  minimum  =  —  3\/3/4. 

3.2  Maximize  z  on  {v2  +  y2  =  1}  Ll  {v  +  y  +  z  =  1},  (0,  0,  1)  =  X\(2x,  2 y,  0)  + 
A2 ( 1 ,  1,  1)  =>  X2  =  1,  2xX\  =  — 1  =  2yX\  x  =  y  =  =bl/V2,  z  = 
1  =p  \/2.  Maximum  1  +  \[2,  minimum  1  —  \J~2. 

3.3  Maximize  ab  +  (1  /2)bd  subject  to  b  +  2a  +  2c  =  P,  b2 / 4  +  d2  =  c 2 .  ( b ,  a  + 
(l/2)d,  0,  (1/2 )b)  =  Ai(2,  1,  2,  0)  +  A2(0,  b/2,  -2c,  2d)  =>  c  =  2d,  b  = 
\pV2d,  a  =  (\/^  +  1  )d  =>  P  =  (4\/3  +  6)J.  Maximum  =  (2  —  V3)^2/4. 

3.4  (2x,  2 y,  2 z)  =  A 1  ( 1 ,  1,  —1)  +  A2(l,  3,  1)  =>-  2x  =  Ai  +  A2,  2 y  =  Ai  + 
3A2,  2z  =  —  Ai  +  A2  =>-  2v  —  y  +  z  =  0.  Together  with  constraints  this  gives 
solution  (0,  1/2,  1/2).  Minimum  1/4.  The  two  constraints  are  planes  which 
intersect  in  a  line.  Question  asks  to  find  minimum  and  maximum  distance 
squared  from  line  to  the  origin.  Line  contains  points  which  tend  to  infinity  so 
no  maximum. 

3.5  The  two  constraints  are  planes  which  intersect  in  a  line.  This  line  contains 
points  which  tend  to  infinity.  /  consists  of  positive  terms  added  together  and 
tends  to  infinity  as  either  v  or  y  or  z  goes  to  infinity.  Since  /  is  always  positive  it 
must  have  an  absolute  minimum.  (2x,2y,4z)  =  Ai (1,  1,  1)  +  A2(l,  — 1,  3)  =>► 
...=>  x  =  17/14,  y  =  16/14,  z  =  9/14.  Minimum  =  707/196.  z  = 
3  -  x  -  y  =  4>(x,  y),  f(x,  y,  <f>(x,  y))  =  x2  +  y2  +  2(3  -  x  -  y)2  and  con- 
straint  becomes  2x  +  4y  =  7. 


Mx)  =  (7  -  2x)/4,  02 W  =  (1/4) (5  -  2x),  f(x,  Mx),  <hW)  =  x2  + 
(7-2x)2/16  +  (5  -2jc)2/8. 

Ill  A  2x  2y  2z  ,  1  2Xx  2Xx  2Ay 

3.6  (-,  -,  -)  =  A(— ,  vV,  ~w)  =>  A  /  0,  -  =  — —  1  = 


a  b  e'  v a2  b2  c2  a  a* 

2X  z  x  y  z  a  b  c 

- =^-  =  -  =  -=^v  =  ±— -,  y  =  =t  — ,  z  =  ±  — , 

c  a  b  c  V3  V3  V3 

.2  i  /,,2  i  / „2\  _ y  \  _l  a  -.2 


a 


b 


3.7  (yz,xz,xy)  =  X(-l/xz,  -1  /yz,  -1  /zz)  =>  X  /  0,  vzyz  =  -A  =  vy2z  = 
xyz2  x  =  y  =  z  and  3/x  =  1  =^>x  =  y  =  z  =  3.  Minimum  27. 

3.8  V(x,  y,  z)  =  xyz ,  (yz,  xz,  xy)  =  X(l/a,  l/b,  l/c)  =>►...=>►  x/a  =  y/b  = 
z/c.  Maximum  =  abc/21 . 

3.9  Join  the  vertices  to  the  centre  and  let  v ,  y ,  z  be  the  angles  at  the  centre.  Products 
of  lengths  of  sides  =  8  R 3  sin  |  sin  j  sin  |  =  f(x,y,z).  Sum  of  squares  of 
lengths  =  47?2(sin2  |  +  sin2  |  +  sin2  |)  =  g(x,  y,  z).  R  is  the  radius  of  the 
circle.  h(x,  y,z)=x  +  y  +  Z~  2i r. 
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(a)  V/  =  A Vh  . . .  =>  cos(v/2)  =  cos(y/2)  =  cos(z/2)  =>  x  =  y  =  z- 

(b)  Vg  =  A Vh  . . .  =>  sin(x/2)  =  sin(y/2)  =  sin(z/2)  =>-  x  =  y  =  z. 

3.10  Since  x 2  >  0,  3x2  —  y5  =  0=^y5>0=^y>0.  Hence  f(x,  y)  =  2y  >  0 
and  /( 0,0)  =  0  implies  /  has  minimum  value  0  at  (0,  0).  The  method  of 
Lagrange  multipliers  does  not  work  since  the  surface  3x2  —  y5  =  0  does  not 
have  full  rank  at  (0,  0). 

3.11  Minimize  g(v)  =  (Vf(P),v)  over  (v,  v)  =  h(v )  =  1.  Vg(v)  =  A Vh(v)  =>► 
Vf(P)  =  Av  and  ||v||  =  1  =>  A  =  ±||V/(P)||/2.  Hence  g(v)  =  ...  = 
zb  ||  V/ (P)  || .  Maximum  increase  in  direction  V/ (P)/\\  V/ (P)  || . 

3.12  Let  yt  =  Xi/i.  Maximize  n\y\  •  •  •  yn  on  y\  —  1-  Use  the  method  in 

Example  3.2  to  get  y\  =  =  . . .  =  yn  =  1  / \/n  at  maximum. 

3.13  If  x,  y  and  z  are  perpendicular  distances  to  the  sides,  of  length  a ,  b  and  c,  then 
it  is  necessary  to  minimize  d ,  where  J2  =  x2  +  y2  +  z2,  subject  to  constraint 
Area  =  A  =  (1/2  )(ax  +  by  +  cz).  Minimum  =  A  A2  /(a2  -\-b2  +  c2). 

3.14  Nearest  point  on  line,  (11,2,  —4).  Distance  =  y/6. 

3.15  Since  —1  <  cos^  <  +1,  the  result  in  Exercise  2.12  is  equivalent  to  the 
Cauchy-Schwarz  inequality.  We  get  the  equality  case  by  considering  when 
cos  9  =  ±1. 

3.16  The  constraints  form  a  compact  subset  of  M4  so  the  function  has  a  maxi¬ 
mum  and  a  minimum.  If  (x,  y,u,  v)  =  (a,  0,  b,  0)  then  (xv  —  yu)2  =  0 
and  the  minimum  is  0.  Let  c  =  xv  —  yu.  It  suffices  to  consider  c  ^  0.  If 
(x,  y,  u,  v)  =  (a,0,0,b)  then  (xv  —  yu)2  =  a2b2  and  the  maximum  is  posi¬ 
tive.  W((xv— yu)2)  =  2 c(v,  —u,  —y,  x)  =  A\(2x,  2 y,  0,  0)  +  A2(0,  0,  2 u,  2v), 
cv  =  X\x,  c(—u)  =  Aiy,  c(—y)  =  X2 u,  cx  =  X2V.  c2v 2  +  c2u 2  =  X2x2  + 
A2y2  =>►  c2b2  =  X2a2,c 2  =  cvx—cuy  —  c(vx  —  uy)  =  Ai(v2+y2)  =  Ai^2. 
Hence  X\  =  b2  and  c 2  =  a2b 2  =  Maximum. 

Alternatively,  let  v  =  acosO,  y  =  asinO,  u  =  bcoscj),v  =  b  sincj)  then 
(xv  —  yu)2  =  a2b2  sin 2 (6  —  <j))  and  as  sin2(0  —  <j))  ranges  between  0  and  1  we 
get  the  above  result.  Also  this  exercise  is  a  special  case  of  the  Cauchy-Schwarz 
inequality. 
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4.1  (a)  V/(v,  y)  =  ( lx  +  y  +  2,  v  +  2)  =  (0,  0)  =>►  (—2,  2)  is  the  only  critical 
point.  =  C  qV  det(ff/(_2>2))  =  -1  <  0  =>  /  has  a  saddle 

point  at  (—2,  2). 

(b)  local  minimum  at  (1,  1),  saddle  point  at  (0,  0).  To  show  the  local  mini¬ 
mum  is  an  absolute  minimum  over  {(v,  y);  v  >  1/2,  y  >  1  /2}  note  that 
det (Hf(Xty))  =  3 6xy  —  9  >  0  when  x  >  1/2  and  y  >  1/2. 

(c)  V/O,  y,  z)  =  (3x2z  -  192,  2y  -  z,  x3  ~  y). 


f  (x,  y,  z)  —  Hf(x,y,z) 


Critical  point  (2,  8,  16). 


(2,8,16) 


/ 192  0 

(  0  2 
V  12  -1 


12 

-1 

0 


saddle  point. 

(d)  saddle  points  at  (2,  4),  (—1,  4),  (2,  1),  local  maximum  at  (1,3)  (the  change 
of  variables  u  =  2  —  x,  v  =  4  —  y  simplifies  the  calculations). 

(e)  local  maxima  at  (±1,  ±1,  1),  (±1,  1,  ±1),  (1,  ±1,  ±1). 

(f)  critical  points  at  (±  1  / >/2,  ±  1  /  \[l,  ±  1  />/2),  local  maxima  if  even  number 
of  negative  signs  otherwise  local  minima.  Using  one  variable  calculus, 
max  of  over  R  is  1  / and  min  is  —  1/V2C1/2.  Hence  max  of 
xyze~x2~y2~z 2  over  M3  is  X/l+Jle2!2  and  min  is  —  \/2\f2e2^2 . 

(g)  saddle  point  at  (1,  1,  1/2). 

(h)  saddle  point  at  (2\/2,  2,  —2). 

(i)  saddle  point  at  (  —  1 ,  1/2,  1  /2) . 

(j)  saddle  point  at  (1,  2,  1). 

4.2  We  always  have  f(x,y,z)  >  0.  Since  /(0,  0,  0)  =  0,  /  has  an  absolute 
minimum  at  (0,  0,  0).  Critical  points  satisfy  lax  =  2v(av2+Z?y2+cz2),  2Z?y  = 
2y  (ax2  +  by 2  +  cz 2),  2cz  =  Iziax2  +  by 2  +  cz2).  Critical  points  are  (0,  0,  0), 
(±1,  0,  0),  (0,  ±1,  0),  (0,  0,  =bl)  since  if  (x,  y,  z)  is  a  critical  point,  v  ^  0 
and  y  /  0  then  lax /2by  =  lx /2y  axy  =  bxy  =>►  a  =  b  contradiction. 
Similarly  all  critical  points  can  have  only  one  non-zero  component  and  this  must 
be  1.  Local  minimum  at  (0,  0,  0),  local  maxima  at  (±1,  0,  0),  saddle  points  at 

(0,  dil,  0)  and  (0,  0,  ±1).  Since  2exl  >  x4,  x2e~x~  — >  0  as  i  — >  ±oo 
and  hence  f(x,y,z)  — >  0  if  any  one  of  x,  y,  z  — >  ±oo.  Use  the  method 
in  Example  3.3  to  show  that  f  has  an  absolute  maximum  over  M3  at  a  critical 
point.  Absolute  maximum  a /e. 
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4.3  Local  minimum  at  (1/4,  1/4,  1/4),  degenerate  critical  points  at  (v,0,  0), 
(0,  y,  0),  (0,  0,  z),  (0,  y,  1  -  y),  (x,  1  -  x,  0),  (x,  0,  1  -  x),  x,  y,  z  e  R. 

4.4  The  function  is  only  defined  when  x  /  0  j  /  0  and  z  ^  0.  At  a  critical 
point  2x 3  =  y3  +  z3  =>  3v3  =  v3  +  y3  +  z3.  Similarly  2y3  =  v3  +  z3  and 
2z3  =  x3  +  z3  =>  x3  =  y3  =  z3  =>  x  =  y  =  z  and  (*){(*,  x,  x)  :  x  7^  0}  is  a 
set  of  critical  points. 

i  /-2  ■  *  \ 

Hf(x,x,x)  —  6  I  1  2  1  I 

V  1  1-2/ 

and  det(Hf(X'XtX))  =  0. 

4.5  Minimize  d 2  =  (x  +  l)2  +  (y  —  l)2  +  (vy  —  l)2.  Critical  points  satisfy 

x  +  1  +  (vy  —  l)y  =  0  and  y  —  1  +  (xy  —  l)x  =  0.  Adding  we  obtain 

x2y  +  y2x  —  xy(x  +  y)  =  0.  If  x  =  0  then  y  =  1  and  if  y  =  0  then  x  =  —  1. 
If  y  =  —x  =^v3  +  2v  +  l=  0=>v  <0  and  d 2  =  2(x  +  l)2  +  (v2  +  l)2  = 
x4  +  4(x  +  i)2  +  2  >  2.  Minimum  =  \/2.  Lagrange  multipliers  can  also  be 
used  for  this  problem. 

4.7  V(2X  1  ||X  -  y,  II2)  =2(mX- zr=l  Yi). 

4.8  2v  +  600*  —2 y  —  2y<px  =  0,  4y  +  600^  —  2x  —  2 0  —  2y<j)y  =  0  and 

( fix  =  (j)y  =  0  =>  x  =  y  =  (/)=>  x  =  ±1.  Critical  points  of  0  at  ±(1,  1), 

det(//0(i,i))  =  det(fl^(_i,_i))  =  1/4.  0^(1,  1)  <  0,  local  maximum  at 
(1,  1),  <fixx(—  1,  —1)  >  0,  local  minimum  at  (— 1,  —1).  The  equation  of  the 
level  set  yields  two  solutions;  0i(v,  y)  =  (y  +  a/6  —  3x2  —  5 y2  +  6vy)/3, 

02  (v,  y)  =  (y  —  -y/6  —  3v2  —  5y2  +  6xy)/3.  Both  have  a  local  maximum  at 
(1,1)  and  both  have  a  local  minimum  (—1,  —1). 

4.10  \x\  <  2  and  |y  |  <  2  since  x2  +  y2  =  2  and  |z|  =  1 1  —  v|<l  +  |v|<3  and 
F_1(2,  1)  is  bounded  and  hence  compact  since  it  is  clearly  closed. 
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Chapter  5 


5.1  P'{t)  =  (6sinh3f,  — 6cosh3f,  6),  ||/>,(0||  =  6V2cosh3t,  length 

=  2  sinh  15. 

5.2  (c)  On  [0,  1],  (cos-1^))'  =  -(1  -  s2)“1/2,  P'(s)  =  (-(1  -  s2)1/2,  -s,  0). 

5.3  s(t)  =  tVr2  +  h2,  P(s~Ht))  =  (r  cos  (-7^=)^  sin  ^4=)- 

5.4  (a)  s(t)  =  \/3(U  —  1),  s~l(t)  =  log(-^  +  1).  Unit  speed  parametrization  t  — > 

+  1)  (cosdogC-^  +  1)),  sin  (log  (-^  +  1)),  l)  for  0  <  t  <  V3(e  -  1). 
(b)  =  sinh_1(t/V2).  Unit  speed  parametrization 


5.5 

5.6 


(( 


t2\l/2 

1  +  t) 


’  V2 


,  sinh 


where  0  <  t  <  V2sinh(l) 
Let  P(t)  —  ^ 


for  0  <  t  <  4. 


We  need  0:  [0,  1]  -*  [0,  1],  0'(O  >  0  for  0  <  t  <  1,  0(0)  =  0,  0(1)  =  1, 
0'(O)  =  0,  0'(1)  =  0.  Take  0(0  =  3t2  -  2t3. 


P{t)  = 


(1,  2)  +  (3t2  —  2t3)(— 2,  -4), 

•  (-1,  -2)  +  (3 (t  -  l)2  -  2 (t  -  1)3)(5,  2), 
.  (4,  0)  +  (30  -  2)2  -  2 (t  -  2)3)(— 3,  2), 


0  <  t  <  1 

1  <  t  <  2 

2  <  t  <  3. 


5.8  This  exercise  shows  that  the  rate  of  change  of  /  at  Xo  along  two  curves,  which 
pass  through  Xo,  depends  only  on  the  tangents  to  the  curves  at  Xo. 

5.9  To  show  (T X,  TY)  =  (X,  Y)  expand  ||T(X  +  T)||2  and  ||T(X  -  T)||2  and 
consider  the  difference.  Use  exercise  2.12  to  show  that  angles  are  preserved.  To 
show  that  area  is  preserved  it  suffices  (?)  to  show  that  the  area  of  rectangles  is 
preserved.  (Hint:  think  of  Riemann  sums.) 
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Chapter  6 

6.1  (a )  F(P(t))-  P'(t)  =  t,  27r2;(b)  P'(t)  =  t2  +  2t7  +  3t6,  85/84;  (c) 

F(P(t))  •  P'(t)  =  (cose1 ,  e,  er )  •  (0,  1,  er)  =  e  +  e2t ,  (£8  +  8^  —  l)/2. 

6.2  (a)  v2  +  yz  +  exz ,  (b)  does  not  have  a  potential,  (c)  vy  +  z2  +  sin(vyz),  (d) 
sinO2  +  yz). 

6.4  (a)  Use  Exercise  1.10,  (— 2y-\-z,  2x  —  3 z,  —  v  +  3y)/||X||3;  (b)  (xz2  —  xy2,  yx2  — 
yz2,zy2-zx2)/ ||X||3;(c)  (0,0,0). 

6.6  This  exercise  shows  that  div  and  curl  operate  like  derivatives.  In  (b)  the  minus 
sign  is,  perhaps,  unexpected.  A  very  careful  application  of  the  definitions  is 
needed  to  verify  these  formulae. 

6.9  Use  Exercise  6.7,  V(/(||X||))  =  V(f(Zx2  +  y2  +  z2))  =  f(\\X\\)X/\\X\\  and 

vI(/(»X!»  =  |  (/Wll>  ■  pj)  +  |  (/'CIIJflD  ■  |ij) 

+ 1  -m) 

=/"(iixi)j2+;y2,+" + /'(iixiD  • 2(j2 u;2 + ;2) . 

Then  g  harmonic  V2(/(||X||))  =  0<s-  2  f'(r)/r  +  f"(r)  =  0  r2  f'(r)  + 
2 rf(r)  =  (r2  f(r ))'  =  0  f(r)  =  C/r 2  &  f(r)  =  B  +  ( A/r ). 
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Chapter  7 


7.1  P(t)  =  (3cosL  9sin0, 0  <  t  <  27r,  ellipse,  Curvature  27(9sin2^  + 
81  cos2  t)~ 3//2,  maximum  curvature  at  (0,  ±9). 


7.2  |ft 


^_/2j\2  2  f2x  \  2 

a2^  b2  ^  blKa2) 


ab 


+ 


^2\3/2’ 

) 


If  the  ellipse  has  an  anti¬ 


clockwise  orientation  then  \k\  =  k,  otherwise  k  =  — 1^|. 

7.4  Let  P(t)  =  (x(t),  y(t),  z(t))  be  unit  speed.  P'(t)  =  (xf (t) ,  y' (t) ,  z' (0)  is 
independent  of  t  O  all  tangents  are  parallel  o  P(t)  =  (a  +  bt,  c  -\-  dt,e  + 
ft). 

7.5  (a)  T(t)  =  ((1  +  01/2,-(l  -  01/2,  V2)/2,  N(t)  =  ((1  -  f)1/2,  (1  + 
01/2,0)/V2,  Bit)  =  (-(1  +  f)1/2,  (1  -  01/2,  V2)/2,  nit)  =  Tit)  = 
V2/4vT^72.  (b)  Tit)  =  (-(1  -  t2)1'2,  -t,  0),  Nit)  =  it,  -(1  -  f2)1/2,  0), 
Bit)  =  (0,0,1),  K(t)  =  1/a/T  —  /2,  r(/)  =  0.  (c)  r(f)  =  0(1  + 
/2)_1/2, 2,  (1  +  ?2)-1/2)/V5,  Nit)  =  ((1  +  /2)_1/2, 0,  -til  +  f2)”1/2), 
Bit)  =  (—2/(1  +  f2r1/2,  1,  -2(1  +  t2rl'2)/V5,  Kit)  =  1/75(1  +  t2), 
rit)  =  2/V5(l  +  t2).  (The  point  of  this  exercise  was  to  calculate  the  Frenet- 
Serret  apparatus  directly.  The  curve  in  (b)  lies  in  the  vy -plane.  By  Proposition 
8.1,  r(t)  =  0  and  B(t)  =  ±(0,0,  1).  It  is  necessary  to  calculate  B(t)  in  order 
to  check  which  sign  to  take  but  the  above  advance  information  is  useful  in  order 
to  know  what  to  expect.) 

7.6  ||  P'{t)  ||  =  a  sec  a.  Let  ab  =  cos  a.  Then  2(0  :=  ( a  cos  bt,  a  sin  bt,  t  sin  a) 
is  unit  speed.  T\t)  =  (— ab2  cos  bt,  —  ab2  sinr,  0),  tv(t)  =  ab2,  N(t)  = 
(—  cos  bt,  —  sin  bt,  0),  N(0)  =  (— 1,  0,  0),  and  normal  points  into  cylinder, 
hence  centre  of  curvature  =  2(0  ±  K,(t)N(t)  =  2(0  ±  T'(t)  =  (( a  — 
ab2)  cos  bt,  ia  —  ab2)  sin  bt,  t  sin  a),  a  helix,  on  the  cylinder  x2  -\-y2  =  a2 (l  — 
b2)2. 


7.7  2(0  =  (a  cos(^  sinh  l(^)),a  sin(|  sinh  l(^)),  *Jb2  ±  t2)  is  a  unit  speed 
parametrization.  Osculating  plane  at  2(0  \=  ( q\(t ),  q2(t),  q^(t))  is  perpen¬ 
dicular  to  Q\t)  x  Q"(t)  and  tangent  plane  at  2(0  is  perpendicular  to  Q(t)  := 
(qi(t),q2(t),0).  It  suffices  to  show  [g'(0  x  Q"(0  •  g(0] /IIG"(0II  •  112(011] 
=  a(t)  is  constant.  ||  2(011  =  cl,  Q\t)  x  Q"(t)  •  2(0  =  ab/(b2  ±  t2), 
||  2"(0II  =  b\/ a2  ±  b2 /a(t2  ±  b2)  and  a(t)  =  a/Va 2  ±  b2.  Alternatively,  if 
P(t)  =  ( a  cos  t,  a  sin  t,  b  cosh  y)  and  P(t)  =  (< a  cos  t,  a  snU,  0)  then  results 

in  Chap.  8  show  it  suffices  to  prove  [P'(t)  x  P"(t)  •  P(t)]  ||P'(0  x  P"(t)\\  • 


||P(0ll  is  constant.  ||P(0ll  =  a,  P'(t)  x  P"(t)  •  P(t)  =  [a4  cosh (at/b)\/b, 
and 


P\t)  x  P"(0H  =  [a2 (a2  +b2)l/2cosh(at/b)]/b. 


7.11  2  (0  =  B(t)  implies  g  is  unit  speed  and  Qf  =  Tf  =  B,  Q "  =  T~  =  Bf  = 
—tN  =  tif  A/f .  Hence  tvf  =  r  (since  r  >  0  and  Kf  is  positive)  and  Nf  =  —N. 
By  =  Tf  x  Nf  =  B  x  (-N)  =  T ,  B~  =  —  if  Nf  =  T'  =  kN.  Hence  Tf  =  k. 

7.12  X  =  rT  +  kB. 


Solutions 


241 


Chapter  8 


8.1  By  inspection  one  can  take  u  =  (0,  0,  1)  in  (a)  and  (b)  and  u  =  (0,  1,  0)  in  (c). 

8.2  T{t)  =  (1,  -t~2,  -r2  -  l)/(2  +  2 1~2  +  2 f~4)1/2,  N(t)  =  (1  +  2 1~2,  2  + 
t~2, 1  — r-2)/(6+6f_2+6r-4)1/2,  B{t)  =  (1,  -1,  1)/77k( t)  =  2f3V3(2+ 
2t~2  +  2r-4)-3/2,  r{t)  =  0. 

Note  By  inspection  P(t)-(  1 ,  — 1,  1)  =  —  1  so  T  lies  in  the  plane  v  —  y  +  z  =  —  1. 
By  Proposition  8.1,  r(t)  =  0  and  B(t )  =  ±(1,  —1,  1)/V3.  To  know  which  sign 
to  take  it  was  necessary  to  do  the  above  calculations. 

8.3  P'(0)  =  (—  tan  0,  cot  6,  \/2),  ||P/(0)||  =  P"{6)  =  (—  sec2  0,  —cosec 20, 

0),  ||  P'(0)  x  P"(0)  ||  =  4V2/(sin226>),  k(0)  =  sin2<9/V2. 

8.4  P\t)  =  3(2 1,  1  -t2,  1  +t 2),  ||P'(0II  =  372(1  +t2),  T(t)  =  (f^.  7^77 

r(0  •  (0,  0,  ±1)  =  ±-^=.  P'(t)  X  P"(f)  =  18(2/,  1  -  /2, -1  -  t2), 

\\P’(t)  X  P"(t)\\  =  1872(1+7),  K(0  =  1/3(1  + 12)2,  P"'(t )  =  6(0,  -1,  1), 
(P"\t),  P'(t )  x  P"(t))  =  -18  •  6  •  2,  r(f)  =  -1/3(1  +  t2)2. 

8.5  K(t)  =  (72/3)e“r,  r(t)  =  (l/3)e_r. 

8.6  The  normal  plane  at  P(t)  is  {X  :  (P(f)  —  X,  P'(t)}  =  0}.  If  Xo  lies  in  every 
normal  plane  then  (P(t)  —  X o,  Pf(t))  =  0  for  all  t.  Hence  ^(P(t)  —  X o,  P(t)  — 
X0)  =  0  and  ||  P{t)  —  Xo||  is  independent  of  t,  and  the  curve  lies  on  a  sphere. 
(P(6),  Pr(0))  =  —2  sin  26,  {(a,  b,  c),  Pr{6))  =  2 a  sin  26-\-2b  sin  6  +  2c  cos  26. 
Hence  a  =  —1,  b  =  0  and  c  =  0  imply  (P(6)  —  ( a ,  b,  c),  P\0))  =  0.  Centre 
(—1,  0,  0),  radius  2. 

8.7  P\t)  =  (a,2bt,3t2),  \\P'(t)\\2  =  a2  +  4 b2t2  +  9 t4,  P"(t)  =  (0,2 b,6t), 
P'(t )  x  P"(f)  =  (6^2,  -6at,  2 ab),  \\P\t )  x  P"(f)||2  =  36 b2t4  +  3 6a2t2  + 
4 a2b2,  P"\t )  =  (0,  0,  6),  P'"(f)  •  P'(f)  x  P"(f)  =  12a/?, 

r(0/«(0  =  \\P'(t)f(P'"(t)  ■  P\t)  x  P"{t))/\\P'{t)  x  />"(f)H3 

=  I2ab(a2  +  4  b2t2  +  9f4)3/2 /(36b2t4  +  36  a2t2  +  4a2b2)3/2. 

P  parametrizes  a  generalised  helix  a2  +  4 b2t2  +  9/ 4  =  a  (3  6b2t4  +  3  6a2t2  + 
4 a2b2)  for  some  a  e  R  O  a  =  1  /4 b2  and  4b2  =  a36a2  4/?4  =  9a2. 
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Chapter  9 


9.1  Jq1  x2dx  ■  f”/4  sin2  ydy  =  [x3/3]l0[x/2  -  (sin  2x)/4]q/4  =  (t r  -  2)/24. 

9.2  (a)  Jq  {fg  x  cos(x  +  y)dy}  dx  = 

(b)  (x2  +  y2)dy}dx  =  8/3  f0l(l  -  x2)l/2(2  +  x2)dx  = 

37r/2;  or  alternatively,  let  v  =  r  cos  0,  y  =  1  +  r  sin0,  dxdy  =  rdrdO 

and  Jq71^2  +  1  +  2r  sinO)rdrdO  =  3n/2 , 

(0  fiili/y  ^dx)dy  =  fi  (~y  +  y3)  dy  =  9/4. 

9.3  /02  {fyl~y2  dx\dy  =  8/3. 

9.4  Let  F  =  (Q,  —  P).  If  t  — >  (x(0,  y(0)  is  a  unit  speed  parametrization  then 
n  =  (yr,  —x').  Now  apply  Green’s  Theorem. 

9.5  (a)  J2  f03y  ey2dx J  dy  =  3(c4  -  l)/2, 

cV//*dyJ  dx  =  e2  —  1. 

9.6  JJ  (1  —  xy)dx  dy  =  /J  f(^ 1  A  (1  —  vy)dy|dv  =  (27r  —  l)/8.  The 

*2+;y2<i 

*>(),}’>() 

calculations  are  easier  using  polar  coordinates. 

9.7  20tt. 

9.8  (a)  Once  Green’s  theorem  has  been  used,  symmetry  implies  that  the  answer  is 

0. 

(b)  By  Green’s  theorem  Jr  =  JJ  2 xdx  dy.  This  can  be  evaluated  in 

(x—2)2+y2<4 

the  usual  fashion  but  some  geometry  avoids  all  the  calculations: 


(b)  /r 


0 


2(x  —  2 )dx  dy  +  4 


dx  dy  =  0  +  47T.4  =  167r. 


(x— l)2-\-y2<4 


(x— 2)2+y2<4 


The  first  integral  is  zero  since  x  —  2  has  average  value  0  on  the  disc  and  the 
second  integral  is  4x  (area  of  disc). 

(c)  Jr  =  JJ (—3 y  —  4 x2y)dx  dy  =  J3  dx  •  J*  —3ydy  —  J 3  4 x2dx  •  J*  ydy  = 
Q 

-1025. 
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Chapter  10 

10.1  (a)  (u,  v)  —>  {a  cosh  u  cos  v,  b  cosh  u  sin  v,  c  sinh  m),  — oo  <  u  <  +00,  0  < 

v  <  27 r. 

(b)  (u,  v)  — >►  (a  cosh  u,  b  sinh  u  cos  v,  c  sinh  u  sin  u),  — 00  <  u  <  +00,  0  < 
v  <  27 r; 

10.2  (a)  P\  is  easily  seen  to  be  injective.  For  the  remainder  see  Example  13.5. 

(b)  (P2)x  =  (cos  9,  0,  1),  (P2)o  =  (-x  sin  0 ,  y  cos  0,  0)  and 

(Pi)x  X  (P2)e  =  {-y  cos,  -x  sin,  y  cos2  0). 

If  (Pl)x  x  (Pi)q  =  0  then  cos#  =  sin#  =  0.  This  is  impossible  since 
cos2  6  +  sin2  6  =  1.  If  P2&,  0)  =  P2 (x\,  9\)  then  x  -\-  a  =  x\  +  a  and 
cos  9  =  cos 9\ ,  sin 9  =  sin 9\ .  Hence  x  =  x\  and  9  =  9\. 

(c)  (P3)u  x  (P3)v  =  (u  +  v,  v  -  u,-2)  ^  (0,0,0),  P3(u i,vi)  = 
P3(U2,  V2)  =>  Ui  +  V\  =  U2  +  V2  and  U\  —  V\  =  U2  —  V2  =>  u\  —  U2  = 
v>2  —  V\  =  —  (i>2  -  r’l)  =>  ni  =  V2  and  u\  =  U2. 

10.3  Parametrize  the  ellipsoid  (( x/Jltr )2  +  (y/^/2tr)2  +  (z/V2th)2  =  1  using 
ellipsoidal  polar  coordinates  and  take  the  9  =  it /A  cross  section. 

10.4  Parametrization  formula  unchanged.  Range  0  <  9  <  7r/2,  0  <  ip  <  7t/2. 

10.5  ||(0,  0,  1)  +  t((u,  v ,  0)  -  (0,  0,  1))||2  =  1  =>  t2u2  +  t2v2  +  (1  -  t)2  =  1. 

Hence  t  =  0  or  t  =  2/(1  +  u2  +  v2),  v)  =  (2u/(l  +  u2  +  u2),  2u/ 
(1  +  u2  +  v2) ,  1  -  2/(1  +  m2  +  v2)).  v\)  =  (j)(u2,  V2)  =>  m2  +  v\  = 

m2  +  ui  (from  third  coordinate)  =>-  mi  =  112  and  ui  =  V2  from  the  first  and 
second  coordinates  and  <fi  is  injective.  (j)u  x  <fiv  =  (— 2m(1  +  u 2  +  u2)-2, 
—2v(l  +  u2  +  v2)-2 ,  (1  -  ( u 2  +  i72 ) 2 ) ( 1  +  u2  +  u2)-2)  ^  (0,  0,  0)  for  all 
(m,  u)  g  M2.  Also  0(M2)  =  S\( 0,  0,  1). 

10.6  Let  P(m)  =  (m  cos  m,  m  sin  m,  mV 3),  0  <  m  <  2,  Length  =  JQ2(4  +  u2)l^2du  = 
2(V2  +  sinh_1(l))-  (Use  u  =  2  sinh  6,  sinh-1(l)  =  log(l  +  V2)  and 
sinh(2 sinh_1(l))  =  f(l  +  V2)2  -  (1  +  V2)~2)  /2  =  2^2. 

10.7  V/(v,  y,  z)  =  (1  +  y,  x  +  z,  y)  ^  (0,  0,  0)  since  either  y  ^  —  1  or  y  ^  0. 
t/0  =  {(x,  y,  z)  :  y  ^  0}  and  4> o(x,  y)  =  (x,  y,  l~yxy),  (x,  j)  e  K2  \ 

{(x,  y);  y  #  0}.  U\  =  {(x,  y,z)  e  S,y  ^  -1}  and  4>i(y,z)  = 

for  (y,  z)  e  K2  \  { (v ,  z);  y  ^  —  1}.  (f/o,  0o)  and  (t/i,  0i)  are  graphs  and  hence 
parametrized  surfaces. 

10.8  At  time  t,  L  is  at  units  above  the  vy-plane.  The  (x,  y)  coordinates  of  a  point 
u  units  along  L  are  ( u  sin  bt,  u  cos  bt)  after  time  t. 
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Solutions 


Chapter  11 


11.1  P(r,9)  =  (r  cos  0,  r  sin  6,  r2),  0<r<2,  0  <  0  <  27 r,  EG  —  F2  = 
r2(  1  +  4r2),  surface  area  =  7t(173//2  —  l)/6. 

11.2  (a)  P(x,  6)  =  (x,  x 3  cos  0,  v3  sin  6),  0  <  x  <  1,0  <  6  <  2n,  EG  —  F2  = 

x6(l  +  9x4).  Use  substitution  u  =  1  +  9x4,  surface  area  =  7r(103/2  — 

D/27. 

(b)  P(x,  6)  =  (x,  x2  cos  0 ,  x2  sin  0),  0  <  x  <  1,0  <  0  <  2n,  EG  —  F2  — 
x4(l  +  4x2).  Surface  area  =  Jq27F  x2 VT  +  4x2dx  =  ^  J0sinh  ^ 

sinh2(0)  cosh2(0)<i0  =  ^  J0smh  ^(cosh(40)  —  1  )d6  =  7r(18\/5  — 

log(2  +  a/5))/32  (use  substitution  2x  =  sinh  0  and  identity  sinh_1(v)  = 
log(v  +  \/  X2  +  1)). 

11.3  E  =  1,  F  =  0,  G  =  r2  +  1,  surface  area  =  JQ27r  jJ  Vl  +  r2  Jr J0  = 

271-  JQsmh  cosh2  (j> dcj)  =  n(V2  +  log(l  +  \/2)).  Substitution  r  =  sinh  </>. 

11.4  If  P(*,  y)  =  (*,  y,  /(*,  y))  then  E  =  1  +  /2,  F  =  /*/y  and  G  =  1  +  f2. 
Hence  EG  -  E2  =  { 1  =  f2){  1  +  /2)  -  f2f2  =  1  +  /x2  +  /2  =  1  +  || V/||2. 

11.5  Use  parametrization  P(r,  0)  =  (r  cos0,  r  sin0,  r2/ 3),  0  <  0  <  27t,  0  <  r  < 

\/3.  Surface  area  =  J02?r  rj  1  +  (4r2/9)  dr<70  =  (7\/2T  —  9)n/6. 

11.6  Using  P(x,y)  =  (x,  y,  vy).  By  Exercise  11.4,  surface  Area  =  jj  (1  + 

x2+y2<a2 

v2  +  y2)^2<ix  dy.  Change  to  polar  coordinates,  (v,  y)  — >  (r  cos  0,  r  sin  6). 

2 

Alternatively,  let  P\(r,  6)  =  ( rcos0 ,  r  sin0,  r-j  sin  20),  0  <  0  <  27t,  0  <  r  <  a. 
Then  E  =  1  +  r2  sin2  20,  F  =  y  sin 40,  G  =  r2(  1  +r2  cos2  20),  EG  —  F2  — 

r2  +  r4.  Surface  area  =  /  rv  1  +  r2drd0  =  27t((1  +  a2)3^2  —  l)/3 

7tf=0  7r=0 

(substitution  w  =  1  +  r2). 

11.7  First  octant  =^O<0<7t/2,  between  the  planes  =>>  0  <  r  <  Bn/3,  in¬ 
side  cylinder  =>-  r2  cos2  0  <  r2/4  =>►  cos0  <  \  =>  tt/3  <  0  <  7t/2.  E  = 

(r2/64)  +  l,  E  =  0,  G  =  r2 .  Surface  area  =  J^/2  f^^3  r^/l  +  (r2 /64)  drdO  = 
224tt/9. 
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Chapter  12 

12.1  S  c  {(jc,  y,  z)  :6x  +  3y  +  2z  =  6},  n(p)  =  (6,  3,  2)/7,  F  •  n  =  ff  F  =  f 

s 

ff  V EG  —  F2dx  dy  =  -y  (Area  S)  =  9.  Area  of  triangle  in  M3  with  vertices 

s 

at  a,  b  and  c  is  \  ||  (a  —  b)  x  (a  —  c)||.  Alternatively,  using  projections  onto 
coordinate  planes,  ff  F  =  ff(f,g,h )  =  fff+f[g+f[h  =  l.±.2.3  + 

S  S  PyZ  PZx  Pxy 

2.^.1. 3  +  3. \.1.2  =  9.  Also,  P(x,  y)  =  (. x ,  y,  (6  —  6v  —  3y)/2),  (v,  y)  e 
triangle  A  in  vy-plane  with  vertices  (0,  0),  (1,  0)  and  (0,  2)  parameterizes  S 

and  Px  x  Py  =  (3,  3/2,  1)  has  the  correct  orientation,  ff  F  =  ff  (F,  Px  x 

s  A 

Py)dxdy  =  9(AreaA)  =  9. 

12.3  cp(u ,  u)  =  (u,  u,  wu3),  <pu  =  (1,  0,  u3),  (j)v  =  (1,  0,  3wu2)  and  cpu  x  (pv  = 
(—v3,  —3uv2,  1).  Hence F(c/)(w,  u))  -<pu(u,  v )  x<pv(u,  v )  =  2+  18w2u4  +  2u9. 
Answer  467. 

12.4  P(0,  -0)  =  ((£>  +  a  cos  0)  cos  ip,  (b  +  a  cos  0)  sin  ip,  a  sin  6),  0  <  0,  ip  <  2ir. 
This  surface  is  a  torus  (see  Table  11.1).  F (P(6,ip))  =  ( a  cos  9  cos  ip,  a  cos  0 

sin  ip,  a  sin  6).  (F,  Pq  x  P^)  =  —a2(b  +  a  cos  6),  ff  F  =  —4n2a2b.  Since 

s 

Pq  x  P^( 0,  0)  =  —  a(b  +  a) (l,  0,  0)  points  inwards,  P  is  not  consistent  with 
the  orientation.  Answer  =  —  (— 4n2a2b). 

12.5  Use  spherical  polar  coordinates  (see  Table  11.1)  with  range  0  <  9  <  7r/2, 
0  <  ip  <  2i r.  EG  —  F2  =  a4  sin2  6. 

(a)  y2  +  z2  =  a 2  sin2  0  sin2  ip  +  a2  cos2  0,  4na4/3 

(b)  (EG  —  F2)1//2(v2  +  y2  +  (z  -\-a)2)~1^2  =  a  sin  6/ (2  +  2  cos  0)1^2,  2zra (2  — 

V2). 

12.6  Truncated  cone,  f(r,6)  =  (r  cos  6,  r  sin  6,  r),  0  <  0  <  2tt,  1  <  r  <  3, 
fr  *  fo  =  (~r  cos  0,  -r  sin  0,  r),  (F (f(r,  0)),  fr  x  fe)  =  2 r,  16t r. 

12.7  Fr  x  Fq  =  (sin0,  —  cos0,  r).  (G (E(r,  0)),  Fr  x  Fq)  =  •  •  •  =  —r0  /  (r2  + 

02)y2,  ff  G  =  f**{f°(-rO  /  (r2  +  02)2!2)dr}d0  =  tt(  3  -  VT0)/V2. 

s 

12.8  Method  (b).  Use  polar  coordinates  on  each  coordinate  plane  and  z  =  0  on  the 

vy— plane.  ff  (y2  +  z2)dy  dz  +  ff  tan ~l(x/z)dzdx  + 

Pyz(®  A  AS) 

ff  0ex2+y2dxdy=  ff  r2rdr  d0+  ff  Ordr  dt  = 

PXy(  S)  0<r«3,0<6,<7r/2  0<r<<3,0<(9<7r/2 

a2n(2a 2  +  tt)  / 16. 
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Solutions 


Chapter  13 


13.1  (a)  curl(zi  -  xk)  =  (0,  2, 0),  (3tt  -  8)2a2/3. 

(b)  curl(— yi  +  xj)  =  (0,  0,  2),  w a2. 

(c)  curl(— zj  +  yk)  =  (2,  0,  0),  8a2/3. 


_  2  _  2  \ 

~r  sin  0,  cos  9,  r  1,  curl(y,  z,  x) 


a 


13.2  Pr  X  Pg  =  ( 

P$  ■  (— 1,  —  1,  -1)  =  £(si n0  +  cos  9)  -  r.  fr 


a 

.2 


fo  Ion^^in0  +  cos^)  —  J  dr  =  —irb2. 


=  (-1,-1,-1),  Pr  x 
ydx  +  zdy  +  xdz  = 


13.3  The  mapping  6  — >  (b  -\-b  cos  0,b  —  b  cos  9,  \f2b  sin  0),  0  <  9  <  2n,  para¬ 
metrizes  T  a  circle  of  radius  \[2b  containing  the  points  (2b,  0,0 ),(b,b,  \f2b), 
(0,  2b,  0)  and  (b,b,  —\[2b).  Since  curl (y,z,x)  =  (— 1,  — 1,  — 1)  and  n  • 
(—1,  —1,  —1)  =  —yfl,  fr  ydx  +  zdy  +  xdz  =  — V2(Area  circle)=  —\[2 


7t(V2  b)2  =  —  2\f2rxb2 . 

13.4  T  coils  around  S  n  times.  fr  —ydx  -\-xdy  =  (b  + a  cos  nt)2dt  =  n(2 b2  + 

a2).  Area=  Jq1  JQ27r  r(b  +  a  cos  nt)2drdt  =  7 r(b2  +  ^-). 

13.5  If  G  is  the  vector  field  in  Example  6.6  then  curl(G)  =  F.  For  d  >  0  let 
Tj  =  {(v,  y,z)  :  x2  +  y2  =  1  —  d2}.  Then  T^  U  Fc  suitably  oriented  is 
the  boundary  of  S&>c  and  P(0)  =  ( Vl  —  d2  cos  0,  Vl  —  d2  sin  0,  Vl  —  d2), 
0  <  0  <  27 r,  parametrizes  Tj  so  that  the  sphere  above  the  plane  z  =  d  is 
on  the  left  as  rd  is  rotated.  Hence  G(E(60)  •  P'(0)  =  (sin9,  —  cos  0,0)  • 
Vl  -  d2(-  sin  0,  cos  0,  0)  =  -Vl  -  d2  and  fr/ G  =  (-Vl  -  d2)  •  l(Fd)  = 


— V 1  —  d2 27tV  1  —  d2  =  —2tt(\  —  d2).  By  Stokes’  Theorem 


S  b 


,C 


j  G  =  27 r(c2 


r* 


13.6  P(r,  0)  =  (r  cos  0,  r  sin  0,  b(  1  —  rcons0),  0  <  r  <  a,0  <  9  <  2tt  parametrises 
the  portion  S  of  the  plane  inside  the  cylinder.  Pr  x  Pq  =  (lor la,  0,  r),  curl(y  — 

z, z-x, x-y)  =  (-2,  -2,  —2),ff  (curl (y-z,  z-x,  x-y),  Prx  Pg)dr  dO  = 

v 

—27 xa(a  +  b).  Orientation  inconsistent  with  positive  answer.  Choose  opposite 
orientation.  Answer  27 va(a  +  b). 


Solutions 
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Chapter  14 


14.1 

14.2 


14.3 

14.4 


P(r,  6 ,  z)  =  (r  cos  0,  a  +  r  sin  6,  z),  0  <  r  <  a,  0  <  6  <  2tt ,  0  <  z  < 
(a2  +  r2  +  2 ar  sin  6) /4a ,  | =  r- 

/(r,  0,  z)  =  (r  cos  0,  r  sin  6,  z),  0  <  9  <  n,  0  <  z  <  r,  0  <  r  <2 a  sin  6, 
volumes  jo*™6  rdz)  dr]  dO  =  8a3 /3  sin 0(1  —  cos 2  0)d0  = 

32a3 /9.  Note  that  the  parametrization  in  Exercise  14.1  leads  to  an  unsuitable 
integral.  Use  Fig.  14.13. 

Use  (r,  6,  z)  — >  (r  cos  0,  r  sin  0,  z),  0  <  r  <  1,  0  <  0  <  27t,  0  <  z  <  27r. 
Paraboloid  and  cone  intersect  when  x2  +  y2  =  z2  =  2  —  z,z2-\~z  —  2  = 
0,  z  =  1,  z  =  —2.  Above  plane  z  =  0,  hence  z  =  1.  Region  projects  onto  the 
disc  {(x,  y,  1)  :  x2  +  y2  <  1}  with  paraboloid  above  and  cone  below.  Para¬ 
metrization  (r,  6,  z)  — >  (r  cos  6,  r  sin  0,  z),  0  <  r  <  1,0  <  0  <  2tt,  r  < 


14.5 


M 


d0=5f 


14.7 

14.8 

14.9 


z  <2  -  r2.  Volume  =  Jq1  f2  r  rdz 

(a)  87 r; 

(b)  877; 

(n\  8 1 7r  216 

tc;  4  5  • 

2/3. 

V  =  /ot/ot/o14"^  7  dx}dy}dx  =  4(9 V?>  —  8\/2  +  1)/ 15.  By  symmetry 
the  second  volume  is  8  V. 

'2a 


(a)  Volume  =  A (z)dz  =  f^a  n(z2/3)dz  +  f^a  n(4a2  -  z2)dz  = 

(7ra3)/V3  +  ™3(y  —  3>/3)  =  ^j-(2  —  y/3).  (b)  Use  spherical  po¬ 
lar  coordinates  w  :  (r,  6,  ip)  — >  (r  sin  9  cost/,  r  sin  0  sin  ip,  r  cos  6),  0  < 
6  <  7r/6, 0  <  r  <  2a,  and  0  <  ip  <  2n.  Since  det(wr)  =  r2sin0, 

Volume=  f;/6  sin  OdO  •  r2dr  •  JQ27r  dip  =  ^j-(2  —  V3).  See  also  Exam¬ 
ple  15.3. 

i_ 

12’ 

det(Ff)  =  w2n,  xdx  dy  dz  =  f0  u3du  f0  v(l  —  v)dv  f0  dw  =  1/24, 

///  ^  =  /„■  /„'  /„'  <(»</»•  =  1/2. 


14.10 

14.11 
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Solutions 


Chapter  15 


15.2  div(F)  =  y2  +  x2.  Interior  of  S  =  {(r  cos  0,  r  sin  0,  z)  :  0  <  r  <  V3, 
tt/2  <  0  <  3tt/2,  0  <  Z  <  -r  cos  0},  div(F)  =  r2,  18^3/5. 

15.3  f0“(Cl  u\/q  ^  a  ^  xdz)dy)dx  =  a2cb/ 24. 

15.4  Order  of  integration  is  important,  first  x  and  then  y  to  get  ( e  —  2)/2e. 

15.5  div(x2,  —  y2,  3 xz)  =  5x  —  2 y. 


xdx  dy  dz  = 


ydx  dy  dz  =  7 r.  Solution  37 r. 


x,y,x>0  x,y,x>0 

x2+y2+x2<4  x2+y2+x2<  4 

15.8  Volume  =  ff  (ax2  +  by2)dx  dy  where  D  =  {(x,  y)  :  (. x/c )2  +  ( y/d )2  <  1}. 

D 

If  P(r,0 )  =  (rc  cos  6,  rd  sin  0),  0  <r<l,O<0<  2tt  then  dx  dy  = 
red  •  drdO.  Volume  =  (a c2r 2  cos2  0  +  bd2 r 2  sin2  0)rcd  •  dr  dO  = 

cd  fy  r2dr  •  (ac2  cos2  0  +  M2  sin2  0)d0  =  cdn(ac 2  +  bd2)/ 4. 
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Chapter  16 

16.1  E  =  1,  F  =  0,  G  =  t2  +  b2,  l  =  0,  m  =  -bit1  +  b2)~ll2, 

K  —  —b2  /  it2  +  b2)2. 

16.2  Since  the  surface  of  a  cone  always  lies  on  one  side  side  of  each  tangent  plane, 
K  >  0,  and  since  it  contains  a  line,  K  =  0  and  0  is  a  principal  curvature. 
P(r,0)  =  (r  cos  0,  r  sin  6,  r),  n  =  (—  cos  0,  —  sin  0,  l)/\/2,  E  =  2 ,F  = 
0,G  =  r2,  l  =  0,m  =  0,n  =  —r/y/2.  Principal  curvatures  0,  —  l/r\/2. 

16.3  Use  (16.6),  E  =  5,  F  =  4,  G  =  5,  n  =  (-2,  -2,  l)/3,  /  =  -2/3,  m  =  0, 
n  =  -2/3.  0M(1,  1)  =  (1,  0,  2),  Ml,  1)  =  (0,  1,  2),  vi  =  2,  n2  =  1, 
kp (v)  =  10/123. 

2  2 

16.4  By  (16.6),  Kp(v)  =  V+2u|  U2yy  "  =  a  (constant)  for  all  (t>i,  t>2)  <S>  i>?(/  — 

a£)  +  2nit;2(m  — a/7)  +  r’2(i2  — ctG)  =  0forall(ni,  M  122  =  aF 

and  12  =  aG. 

E  =  1  +y2,  F  =  xy,G  =  l  +  x2  and  /  =  0,  m  =  (1  +  v2  +  y2)~l/2,  n  =  0. 
At  an  umbilic  point  l  =  aE  implies  a  =  0  and  m  =  aF  implies  a  /  0. 
Hence  there  are  no  umbilic s. 

16.5  K  =  36a  n  •  (1  +  9a4  +  9n4)-2,  elliptic  points  uv  >  0,  hyperbolic  points  when 
uv  <  0. 

16.6  m  =  n  =  0^K  =  0.  This  is  the  cylinder  over  the  ellipse  v2  +  (y/2)2  =  1. 

16.7  u  — >  (an,  —bv,  0)  +  a(a,  b,  n),  n  — >  ( au ,  Z?a,  0)  +  n(a,  —  b,  u). 

16-8  K  =  ^2  (f?  +  £?  +  *?)  • 

16.9  It  suffices  to  note  that  the  coefficient  of  A  in  (16.9)  is  En  +  Gl  —  2Fm. 

P(x,  y )  =  (x,  y,  log  cosy  —  log  cos  x),  E  =  sec2v,  F  =  —  tan  v  tan  y,  G  = 
sec 2y,/  =  sec2v/ (1 +tan2  v+tan2  y)1//2,  m  =  0, 12  =  —sec 2y/ (1+tan2  v  + 
tan2  y)1/2. 
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Solutions 


Chapter  17 


17.1  Use  toroidal  polar  coordinates,  L(xPq  +  yP^)  =  \Pq  +  ;  Pu¬ 

ll. 2  cj)(u,v)  =  (u,v,u2  —  2v2),  <fru  =  (1,0,2 u),  (f)v  =  (0,  1,  — 4u), 


uu 


(0,  0,  2),  (puv  =  (0,0,0),  4>vv  =  (0,0, -4),  E  =  1  +  4m2,  F  =  -8 uv, 
G  =  1  +  16v2,  n  =  (-2u,4v,  1)/(1  +  4m 2  +  16V2)1/2,  l  =  2/(1  + 

4m2  +  16V2)1/2,  m  =  0,  n  =  —4/(1  +  4 u2  +  16v2)*/2.  k p  ( II ) 

[  cl  P(t )  =  t)  =  (\/2f,  t,  0).  By  the  chain 


2u^ — 4^2 


(4w2+16u2+l)||ui(/>M+i;2<M2 

rule  7^(0  =  \/2</>M  +(/>u,  and&p^CP'CO)  =  C(2(V2)2— 4)  =  0.  The  straight 
line  t  — >  P(t)  lies  in  S. 

17.3  Suppose  k  ^  0  and  ni  and  112,  the  normals  to  Si  and  S2,  are  linearly  indepen¬ 
dent.  If  ni  •  n2  =  cos  0,  then  ||n2  —  (ni,  n2)ni  ||2  =  sin2  6  and  sin  6  7^  0.  Let 
P  denote  a  unit  speed  parametrization  of  T  with  P(0)  =  p  and  P'(t)  =  T(t). 
If  N  is  the  normal  to  T  in  M3  then  T  _L  ni,  n2  and  N  e  span(ni,  n2).  Then 
DjT  =  P"(t)  =  ftN  and  A /  =  —  (Djn/,  T)  =  (n /,  DjT)  =  (n *,  acN).  Hence 


kN  =  (n/,  AcN)n,-  + 


(n2  —  ni  cos  6,  kN)  (n2  —  ni  cos  6) 


sin  0 


sin  0 


and  ft2 
2 


A2- 


-2Ai  A2  cos  0+A2  cos2  9  2  •  2 


=  \\-\-— - — ^ — 1 - ,  kt  sin  6  =  Aj  sin  0  +  2Ai  A2  cos  6 

+  Af  cos2  6  =  A2  +  A2  —  2Ai  A2  cos  6,  k  =  0  =>-  Ai  =  A2  =  0.  ni  and  n2  are 
not  linearly  independent  =>  ni  =  ±112  and  Ai  =  ±A2. 

17.4  P(l)  =  (1,  1,2)  and  P'(1)/||P'(1)||  =  (2,  1,6)/V4l  =  v,  5  =  *"‘(0) 
where  g(u,  v,  w)  =  u2  +  v2  -  w,  Vg/||  Vg||  =  ,  ]j^]f  coincides 

with  n  on  S  and  is  defined  on  an  open  set  containing  S.  Calculate  D ( Vg / 1|  Vg  || ) 
and  -Dy(Wg/\\Wg\\)  =  -D(Vg/\\Vg\\)  •  v. 


D( 


vg 

HVgll 


10  -8 

)(1,1,2)  =  ^  [  -8  10 

4  4 


0 

0  |,DV(^)(1,1,2) 

0 


9V4T 


(4,  -2,4), 


Mp)  =  3^j(4.  -2, 4)  •  (2,  1,  6)/V4l  =  10/123, 
17.5  Use  Euler’s  formula. 
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Chapter  18 


18.1  Cylinder  =  /  ^0),  f(x,y,z )  =  x2  +  y2  —  1,  V/  =  (2v,  2y,  0).  Hence 

||  (2cos(at  +  b),  2  sin(at  +  b),  0) ,  <p"  (t)  =  (a2  cos(at  +  b), 
a 2  sin  (at  +  b),  0). 

18.2  Unit  speed  parametrization  cj)(t)  =  ( x(t ),  y(t))  ==>  x" x'  +  y"y'  =  0.  Note 
that  p  is  also  unit  speed.  n(p(t))  ||  (y'(t),  —x'(t)cos0,  —x'(t)  sin0),  while 
<p"(t)  =  (x"(t),  y"(t)  cos  6,  y"(t)  sin  6)  =>►  p"(t)  =  a(t)n(<p(t))  where 
a(t)  =  x"(t)/y'(t)  if  y'(t)  7^  0  and  a(t)  =  y"(t)/x!  (t)  if  x'(t)  7^  0.  Since 
cp  is  a  unit  speed  geodesic  (18.5)  implies  p"(t)  =  T'(t)  =  K,n(t)n(cp(t)),  and 
the  normal  curvature  at  P(t,  Oo)  is  x"(t)/y'(t),  if  y'(t )  7^  0,  and  y"(t)/x'(t), 
if  x\t)  ^  0. 

18.3  If  P(t)  =  a  +  tb  parametrizes  the  straight  line  then  P  has  constant  speed  and, 

since  P" (t)  =  T'(t)  =  0,  (18.4)  implies  the  geodesic  curvature  is  zero  and, 
by  Proposition  18.2,  straight  lines  are  geodesics,  (vo,  yo>  £0)  +  t(a,  b,  c)  lies 
in  the  surface  z  =  x2  —  y2  (x 0  +  ta )2  —  (yo  +  tb)2  =  zo  +  tc  for  all 

t  (2xo a  —  2yob  —  c)t  +  ( a 2  —  b2)t2  =  0  for  all  t.  Letting  a  =  1,  b  = 

1 ,  2vo  —  2yo  =  c  and  a  =  1 ,  b  =  —  1 ,  2xo  +  2yo  =  c  gives  two  lines  on  the 
surface. 

18.4  Since  P  is  unit  speed  T (t)  =  P'(t)  and  K,n(t)  =  kp(t)(T (t)).  By  (18.3), 
rg  =  0  n'  =  —KnT  Lp(t)(T(t))  =  Kn(t)T(t)  T(t)  is  a  principal 
curvature  for  alU  T  is  a  line  of  curvature.  T  a  geodesic  =>►  tvg  =  0  =>► 
Tr  =  kN  =  Kn  n.  Hence  N  =  ±n,  B  =  TxN  =  Tx  (±n)  =  =pn5  and 

t N  —  B  =  =F(ny)  =  -F(^gi^)  =  ~FLf> =  and  t  =  Tg . 

18.5  By  Proposition  8.1,  V  lies  in  a  plane.  Using  the  proof  in  Example  8.2,  we 
see  that  the  non-zero  curvature  is  constant.  An  application  of  the  result  in 
Example  8.4  completes  the  proof. 

18.6  By  (18.5),  since  the  tangent  space  to  S  at  P(t)  is  2  dimensional  and  T(t)  _L 
N(t),  we  have  kp(t)(T (t))  =  0  <=►  N(t)\\ns(t)  <=►  N(P(t))  e  Ts(P(t )) 
<<=>►  Ts(P(t))  =  span  {T(t),  N(t)}  tangent  plane=  osculating  plane. 

18.7  Assume  sphere  has  centre  at  the  origin  and  radius  r.  Let  P  be  a  unit  speed 
parametrization  of  the  curve.  By  (16.5)  the  normal  curvature  is  ±l/r  and, 
by  (18.5),  Kg(t)  constant  -<=>►  K(t)  constant.  By  Example  8.2,  tv(t)  7^  0  and 
r(t)(P(t)  —  c,  B(t))  =  —  (pyp)'  —  0.  This  implies  r(0  =  0  and  Exercise  18.5 
implies  the  required  result. 
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Absolute  area,  136 
Absolute  curvature,  70 
Absolute  geodesic  curvature,  219 
Absolute  maximum,  43 
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Asymptotic  curve,  226 

Axis  of  rotation,  65 

Azimuth,  116 
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Basis,  26,  78 

orthonormal,  76,  213,  217 
unit  vector,  2 
Bijective  mapping,  106 
Binormal,  74 

Boundary,  3,  27,  121,  143,  149,  180,  185 
Bounded  set,  3 
Bugle  surface,  215 


C 

Calva,  225 
Catenary,  203 
Catenoid,  203 

Cauchy-Schwarz  inequality,  31,  34,  53,  87, 
91,  132 

Chain  rule,  5,  17 

Change  of  variables,  118,  167 

Circle  of  curvature,  72 


Clockwise  direction,  48 
Closed  curve,  48 
Closed  set,  3 
Colatitude,  116 
Compact  set,  3,  28 
Cone,  108,  196 

parametrization  of,  173 
Connected  set,  60 
Connected  surface,  219 
Constant  Gaussian  curvature,  209 
Constant  geodesic  curvature,  226 
Constraint,  25 
Continuous  function,  3,  94 
Convex  set,  43,  61 
Coordinate  curve,  111,  204,  208 
Coordinate  function,  26 
Coordinate  plane,  140 
Coordinate  system,  114,  169 
Coordinates 
Cartesian,  106 
curvilinear,  114 
cylindrical,  133,  173,  176 
elliptical,  173 
geographical,  109,  133 
polar,  106,  181 
spherical,  109 
toroidal,  186 
spherical,  133,  172 
toroidal,  126,  134 
Corner,  52 

Counterclockwise  direction,  69 
Critical  point,  35 
Cross  product,  21,  53,  64,  141 
properties  of,  75 
Cross  section,  114,  162,  193 
Cubical  grid,  168 
Curl,  64,  154 
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Curvature,  70,  74,  91 
absolute,  70 
centre  of,  72,  81 
circle  of,  72 

Gaussian,  207-215,  221 
constant,  209 
negative,  225 
zero,  197 
geodesic 

absolute,  219 
constant,  226 
zero,  219,  220 

geometric  significance  of,  7 1 
intrinsic,  218 
line  of,  202 
mean,  203,  214 
normal,  194,  213,  218 
principal,  195,  207 
radius  of,  72 
total,  224 
Curve 

closed,  48 
directed,  47 

piecewise  smooth,  52 
oriented,  47 
parametrized,  49 

Cylinder,  9,  27,  50,  81,  108,  123,  124,  150, 
159,  173,  175,  176,  191,  196 

D 

Degenerate  critical  points,  40 
Derivative,  5 

directional,  7,  111 
partial,  1,  7,  35,  98,  187 
first-order,  7 
higher-order,  8 

Determinant,  14,  26,  38,  75,  212,  213 
Differentiable  function,  4 
Differentiation 
implicit,  18 

Directed  curve,  47,  219 
piecewise  smooth,  52 
Directed  interval,  97 
Distance,  49,  219 
Divergence,  65,  179 
Divergence  theorem,  180-190 
Dot  product,  2 
Double  integral,  93 

E 

Edge,  222 
Eigenspace,  38 


Eigenvalue,  12,  38,  197,  213 
Eigenvector,  12,  38,  197,  213 
Ellipse,  23,27,  73,80,  113 
Ellipsoid,  23,  33,  109,  196,  225 
parametrization  of,  109 
Elliptic  point,  194 
Equator,  222 
Euler’s  formula,  213 
Euler-Poincare  characteristic,  223-226 
Extremal  value,  55 

F 

Face,  223 

Final  point,  47,  52,  219 
Flat  spot,  195,  207 
Frenet  approximation,  85 
Frenet-Serret  apparatus,  78,  85,  86,  90,  91 
Frenet-Serret  equations,  78,  83,  84,  86,  87, 
89,91,217 

Fubini’s  theorem,  95,  161 
Full  rank,  14,  38 
Function 

continuous,  3 
differentiable,  4 
integrable,  93 
Riemann  integrable,  161 
scalar-valued,  138 
smooth,  48 

Fundamental  theorem  of  calculus,  59,  97, 
149, 179 

G 

Gauss  map,  209 

Gauss’  divergence  theorem,  180 
Gauss-Bonnet  formula,  224 
Gauss-Bonnet  theorem,  222 
Gaussian  curvature,  195,  207,  221 
Generalised  helix,  87,  92 
Genus,  225 
Geodesic,  220,  226 
Geodesic  curvature,  217,  218 
Geodesic  torsion,  218,  226 
Geodesic  triangle,  222 
Geometric  mean,  28 
Gradient,  5,  19 

Graph,  8,  15,  94,  105,  120,  193 
Green’s  theorem,  98,  141,  162,  179 

H 

Hanging  chain,  203 
Harmonic  function,  67,  101 


Index 


255 


Helicoid,  131 
Helicoloid,  204 
Helix,  50,  53,78,81 
Hemisphere,  105,  159 
Hessian,  35,  197 
Hilbert’s  Lemma,  209 
Hyperbola,  23 
Hyperbolic  point,  195 

I 

Implicit  differentiation,  1 8 
Implicit  function  theorem,  9,  15,  119 
Independent  variables,  13 
Index,  225 

Initial  point,  47,  52,  91,  219 
Inner  normal,  73 
Inner  product,  2,  53,  77 
Integrable  function,  93 
Integral 

double,  93 
line,  56 
triple,  167 
Interior  angle,  222 

Intermediate  value  theorem,  43,  83,  87 
International  date  line,  115 
Inverse  mapping  theorem,  119 

J 

Jacobian,  171 

L 

Lagrange  multipliers,  26-33,  36,  200 
Lambert’s  equal  area  projection,  123 
Laplacian,  67 
Latitude 

parallels  of,  115 
Length,  2,  49,  53,  221 
Length  function,  5 1 
Level  set,  8,  15,  109 
Line 

normal,  19 
tangent,  19 
Line  integral,  150 
Line  of  curvature,  202 
Linear  approximation,  5,  13,  19 
Linear  mapping,  2 
Linearly  independent  solutions,  13 
Lines  of  longitude,  115 
Local  maximum,  4 
Longitude 
lines  of,  115 


M 

Matrix,  3 

Matrix  of  coefficients,  14 
Maximum,  4 
global,  43 
local,  4,  25,  36 
strict,  4,  36,  39 

Maximum  existence  theorem,  4 
Maximum  rank,  14 
Mean  value  theorem,  124 
Mercator’s  projection,  115,  123 
Meridian,  115,  123 
Minimal  surface,  203 
Minimum,  4 
local,  4,  36 

strict,  4,  36,  39 

Minimum  existence  theorem,  4 
Moving  frame,  76 
Multiplicity,  38 


N 

Negative  Gaussian  curvature,  225 
Non-degenerate  critical  points,  40 
Non-umbilic  points,  202 
Norm,  2 

Normal,  111,  136,  141,  150,  180 
unit,  69,  73,  146 
Normal  curvature,  194,  213 
Normal  form,  20 
Normal  line,  19 
Normal  plane,  19,  92 
Normal  section,  194 


O 

Open  set,  3 
Open  set  of  type  I,  94 
Open  set  of  type  II,  95 
Oriented  curve,  47 
Oriented  surface,  136 
Orthonormal  basis,  76,  213,  217 
Osculating  plane,  74,  81,  85 
Outer  normal,  184 


P 

Paraboloid,  23, 127, 128, 131, 165,  177,  191 
solid,  173 

Parallelepiped,  136,  168,  174 
Parallels  of  latitude,  115 
Parametric  form,  20 
Parametrization,  47,  169 
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unit  speed,  52 

Parametrization  of  cone,  133 
Parametrization  of  cylinder,  133 
Parametrization  of  ellipsoid,  173 
Parametrization  of  paraboloid,  127 
Parametrization  of  solids,  133,  172 
Parametrization  of  sphere,  133,  172 
Parametrization  of  surface  of  revolution,  134 
Parametrization  of  torus,  125,  134 
Parametrized  curve,  49 
Parametrized  surface,  104 
Partial  derivative,  1,  7 
Partition,  52,  93 
Pentagon,  33 

Perpendicular  vectors,  2,  21,  69,  111,  197 
Plane 

normal,  19 
tangent,  19 

Polar  coordinates,  175,  181 
Polygon,  43 
Positive  interval,  135 
Potential 
scalar,  59 
vector,  65 

Principal  curvature,  195,  207 

Principal  curvature  direction,  195,  197,  207 

Principal  minor,  38 

Profile  curve,  108 

Projection,  145,  170 

Pythagoras’  theorem,  77,  86,  218 

Q 

Quadratic  approximation,  72 

R 

Radius  of  curvature,  72 
Rank 

full,  14,  25 
maximum,  14,  26 
Rectangle,  33,  93,  121,  224 
Riemann  sum,  56,  93,  121,  137,  161,  169 
Riemannian  geometry,  208 
Right-hand  rule,  74,  141,  217 

S 

Saddle  point,  35,  36,  39,  196 

Scalar  field,  59 

Scalar  product,  2 

Scalar  valued  function,  138 

Set 

bounded,  3 


closed,  3 
compact,  3,  28 
connected,  60 
convex,  43 
open,  3 
Shape,  123 
Shape  operator,  211 
Shortest  path,  219 
Smooth  function,  48 
Solid,  172 

Solid  cylinder,  173,  180 
Solid  ellipsoid,  173 
Solid  of  revolution,  173 
Solid  sphere,  181 
Solid  torus,  173,  185 
Speed,  49 

Sphere,  3, 9,  85, 92, 108, 128, 138, 172, 209, 
224 
solid,  172 

Sphere  of  closest  fit,  208 
Spiral  staircase,  155 
Square  filling  curve,  124 
Stokes’  Theorem,  149,  179 
Surface,  119 
connected,  219 
minimal,  203,  205 
normal,  217 
of  revolution,  105 
orientable,  138 
oriented,  136 
parametrized,  104 
positive  side  of,  136 
simple,  104 

Surface  area,  29,  121,  122,  201 
Surface  of  revolution,  131,  134 
Surface  tension,  203 
Symmetric  matrix,  35 
Symmetric  operator,  212 
System  of  equations,  13 


T 

Tangent 

space,  19,  211 
unit,  58 

Tangent  line,  19 
Tangent  plane,  19,  137,  193 
Tangent  space,  111 
Tangent  vector,  111 
Taylor  series,  84 
Tetrahedron,  178 
Theorem 

Gauss  divergence,  180 
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Gauss-Bonnet,  222 
Green’s,  98 
implicit  function,  119 
intermediate  value,  43 
Inverse  Mapping,  119 
maximum  existence,  4 
minimum  existence,  4 
Pythagoras,  77,  86,  218 
Theorema  Egregium,  208 
Torsion,  74,  91 

geometric  significance  of,  83-86 
zero,  83 

Torus,  125, 159, 173, 185, 198,209,214,227 
parametrization  of,  185 
Triangle,  33,  53,  75,  145,  222 
Triple  integral,  162 


Unit  tangent,  221 
Unit  vector  basis,  2 
Upper  half-plane,  43 


V 

Vector  held,  55,  138,  225 
conservative,  59 
continuous,  55 
smooth,  55 
Velocity,  49 
Vertex,  223 

Volume,  29,  94,  137,  161 
Volume  of  torus,  173 


U 

Umbilic  point,  195,  196,  207 
Uniform  continuity,  161 
Unit  normal,  184,  193 
Unit  speed  parametrization,  47,  69 


W 

Weingarten  mapping,  211,  214 


Z 

Zero  torsion,  226 


